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Adaptive  Finite  Element  Methods  with  Convergence 

Rates  * 

Peter  Binev,  Wolfgang  Dahmen,  Ron  DeVore 


Abstract 

Adaptive  Finite  Element  Methods  for  numerically  solving  elliptic  equations  are 
used  often  in  practice.  Only  recently  [12],  [17]  have  these  methods  been  shown 
to  converge.  However,  this  convergence  analysis  says  nothing  about  the  rates  of 
convergence  of  these  methods  and  therefore  does,  in  principle,  not  guarantee  yet 
any  numerical  advantages  of  adaptive  strategies  versus  non-adaptive  strategies.  The 
present  paper  modifies  the  adaptive  method  of  Morin,  Nochetto,  and  Siebert  [17] 
for  solving  the  Laplace  equation  with  piecewise  linear  elements  on  domains  in  M2 
by  adding  a  coarsening  step  and  proves  that  this  new  method  has  certain  optimal 
convergence  rates  in  the  energy  norm  (which  is  equivalent  to  the  H 1  norm).  Namely, 
it  is  shown  that  whenever  s  >  0  and  the  solution  u  is  such  that  for  each  n  >  1 ,  it  can 
be  approximated  to  accuracy  0{n~s )  in  the  energy  norm  by  a  continuous,  piecewise 
linear  function  on  a  triangulation  with  n  cells  (using  complete  knowledge  of  u ) ,  then 
the  adaptive  algorithm  constructs  an  approximation  of  the  same  type  with  the  same 
asymptotic  accuracy  while  using  only  information  gained  during  the  computational 
process.  Moreover,  the  number  of  arithmetic  computations  in  the  proposed  method 
is  also  of  order  0(n )  for  each  n  >  1.  The  construction  and  analysis  of  this  adaptive 
method  relies  on  the  theory  of  nonlinear  approximation. 

Key  Words:  elliptic  equations,  finite  element  methods,  adaptive  refinements,  rates  of 
convergence,  nonlinear  approximation. 

1  Introduction 

Adaptive  methods  are  frequently  used  to  numerically  compute  solutions  to  elliptic  equa¬ 
tions.  While  these  methods  have  been  shown  to  be  very  successful  computationally,  the 
theory  describing  the  advantages  of  such  methods  over  their  non-adaptive  counterparts 
is  still  not  complete.  For  example,  only  recently  [12],  [17]  have  there  even  been  proofs  of 
convergence  of  such  methods.  These  proofs  of  convergence  still  do  not  prove  any  guar¬ 
anteed  advantage  of  these  adaptive  methods  since  there  is  no  analysis  of  their  rate  of 
convergence  in  terms  of  the  number  of  degrees  of  freedom  or  the  number  of  computations. 

*This  work  has  been  supported  by  the  Office  of  Naval  Research  Contract  Nr.  N0014-91-J1343  , 
the  Army  Research  Office  Contract  Nr.  DAAD  19-02-1-0028,  the  National  Science  Foundation  Grants 
DMS0221642,  the  Deutsche  Forschungsgemeinschaft  grant  SFB  401,  the  TMR  Network  “Wavelets  in  Nu¬ 
merical  Simulation”  funded  by  the  European  Commission  and  the  Alexander  von  Humboldt  Foundation. 
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Recently,  an  analysis  of  rates  of  convergence  for  wavelet  based  adaptive  methods  was 
given  in  [6], [7].  These  papers  derive  an  adaptive  wavelet  based  algorithm  for  solving 
elliptic  problems  and  show  that  this  algorithm  has  optimal  efficiency  in  the  sense  that  if 
the  solution  u  can  be  approximated  (using  complete  knowledge  of  u)  in  the  energy  norm 
by  an  n-term  wavelet  expansion  to  accuracy  0(n~s),  n  — >■  oo,  then  the  adaptive  method 
will  do  the  same  using  only  knowledge  of  u  gained  through  the  adaptive  iteration.  Wavelet- 
methods  vary  from  their  FEM  counterparts  in  that  they  can  be  viewed  as  solving  linear 
systems  that  are  finite  sections  of  one  fixed  infinite  dimensional  matrix  problem  whose 
solution  gives  the  wavelet  coefficients  of  u. 

The  theoretical  foundation  of  Adaptive  Finite  Element  Methods  (AFEM)  is  less  sat¬ 
isfying.  There  is  no  known  algorithm  with  a  proven  rate  of  convergence  save  for  the 
univariate  case  [4],  The  purpose  of  the  present  paper  is  to  give  an  AFEM  and  prove 
convergence  rates  for  this  method  which  are  the  analogue  of  the  wavelet  case.  Our  al¬ 
gorithm  is  not  much  different  from  existing  adaptive  methods  based  on  bulk  chasing  of 
a  posteriori  error  estimators.  The  one  main  difference  is  the  utilization  of  a  coarsening 
strategy.  We  should  mention  that  coarsening  also  played  an  important  role  in  the  analysis 
of  adaptive  wavelet  methods.  However,  in  the  practical  implementation  of  the  adaptive 
wavelet  methods,  for  many  problems,  coarsening  is  not  needed.  The  same  may  be  the 
case  for  AFEM. 

We  primarily  view  the  present  paper  as  a  contribution  to  the  theoretical  analysis  of 
AFEM  rather  than  the  construction  of  an  adaptive  method  that  outperforms  other  adap¬ 
tive  methods  in  practice.  In  particular,  we  wish  to  clarify  whether  a-posteriori  information 
can  lead  to  an  adaptive  algorithm  that  exhibits  asymptotically  optimal  performance.  In 
spite  of  the  theoretical  emphasis  it  should  not  be  excluded  that  some  of  the  ideas  of  the 
present  paper  may  be  useful  in  practice.  One  of  these  tools  is  the  theory  of  nonlinear 
approximation  by  piecewise  polynomials.  Since  adaptive  methods  are  a  form  of  nonlin¬ 
ear  approximation,  this  theory  will  on  the  one  hand  help  us  to  provide  a  benchmark  for 
measuring  the  success  of  adaptive  methods,  and  on  the  other  hand,  provide  an  effective 
implementation  for  the  coarsening  (see  §4.5). 

Adaptive  Finite  Element  Methods  have  several  complications  that  make  their  analysis 
more  cumbersome.  These  include  the  need  for  graded  meshes,  the  problem  of  hanging 
nodes,  and  the  analysis  of  a-posteriori  error  estimators.  If  not  for  these  complications,  the 
analysis  in  this  paper  would  be  considerably  simplified.  In  order  to  present  the  ideas  of 
this  paper  in  their  simplest  form,  we  shall  try  to  minimize  these  obstacles.  In  particular, 
we  shall  restrict  ourselves  to  the  Poisson  problem 

—A u  =  f  in  fl,  u  =  0  on  dQ,  (1.1) 

where  If  is  a  polygonal  domain  in  El2  and  Oil  is  its  boundary.  We  shall  also  consider 
only  approximations  of  the  solution  u  by  piecewise  linear  elements  using  a  very  specific 
adaptive  refinement  strategy  (called  newest  vertex  bisection)  well-known  in  the  FEM 
literature.  In  this  way,  the  essentials  of  our  arguments  will  be  clear  and  we  can  also  call 
on  several  known  results  concerning  a-posteriori  error  estimates  that  can  be  found  in  the 
literature. 

We  conclude  this  introduction  by  briefly  describing  the  structure  of  this  paper.  In 
§2  we  discuss  the  general  form  of  adaptive  Finite  Element  Methods  which  is  marking, 


2 


subdivision,  and  completion  (to  remove  hanging  nodes).  We  then  consider  in  detail  the 
structure  of  subdivision  using  newest  vertex  bisection.  We  shall  introduce  a  simple  la¬ 
belling  for  edges  that  will  facilitate  the  analysis  of  this  type  of  subdivision.  The  main 
result  of  this  section  is  Theorem  2.4  which  bounds  the  number  of  cells  in  the  completion 
process  by  the  number  of  marked  cells.  This  bound  is  vital  in  proving  optimal  convergence 
rates.  §3  recalls  Galerkin  approximations. 

In  §4  we  study  adaptive  approximation  by  piecewise  linear  functions  on  adaptively 
generated  triangulations.  The  spirit  of  this  section  is  to  understand  how  to  construct 
good  adaptive  approximations  to  a  known  function  w.  In  particular,  we  introduce  the 
algorithm  of  [2]  which  will  be  used  heavily  in  our  adaptive  Finite  Element  Algorithm  for 
solving  (1.1).  Namely,  it  is  used  to  approximate  the  right  hand  side  /  (see  §4.4)  and 
for  our  coarsening  strategy  (see  §4.5).  We  also  discuss  in  this  section  what  are  optimal 
approximation  rates  to  a  known  function.  This  will  provide  a  benchmark  for  our  analysis 
of  adaptive  Finite  Element  algorithms. 

In  §5,  we  recall  the  adaptive  FEM  method  in  [17]  (we  call  it  the  MNS  algorithm),  and 
record  some  of  the  proven  facts  about  this  method  that  will  be  used  in  the  present  paper. 
This  includes  their  introduction  of  local  a-posteriori  error  estimators  and  their  analysis 
of  how  these  estimators  can  be  used  to  bound  global  errors.  In  §6,  we  make  some  minor 
modifications  of  the  MNS  algorithm  described  in  §5. 

In  §7,  we  describe  the  main  ingredients  of  our  new  adaptive  algorithm.  We  show 
its  optimal  rates  of  convergence  in  §8.  We  conclude  the  paper  with  an  appendix  which 
discusses  the  smoothness  conditions  that  govern  rates  of  convergence  by  adaptive  methods. 
These  results  are  not  important  for  the  analysis  in  the  present  paper  but  may  be  of  interest 
to  the  reader. 


2  Newest  vertex  bisection  and  completion 

This  section  has  three  purposes.  The  first  is  to  set  forward  some  of  the  notation  we  shall 
use  in  this  paper.  The  second  is  to  introduce  the  main  form  of  adaptive  Finite  Element 
Methods  which  is  marking,  subdividing,  and  completing.  The  third  is  to  introduce  and 
analyze  the  particular  form  of  subdivision  we  shall  use  in  this  paper,  the  so  called  newest 
vertex  bisection  method. 

Let  fl  be  a  polygonal  domain  in  Ft2.  We  shall  use  P  to  denote  a  partition  of  fl 
into  triangular  cells  A.  This  means  that  fl  =  UagpA  and  any  two  A,  A'  G  P  satisfy 
meas(A  fl  A')  =  0  where  here  and  later  in  this  paper  meas  denotes  the  Euclidean  measure 
in  Ft2.  Given  such  a  partition,  we  let  Sp  denote  the  space  of  continuous,  piecewise  linear 
functions  subordinate  to  P  which  vanish  on  Sfl.  A  function  S  is  in  Sp  if  and  only  if  S 
is  a  linear  function  on  each  A  G  P,  S'  is  continuous  on  fl,  and  S  vanishes  on  <9fl,  i.e. 
SP  C  P^fl). 

We  denote  by  Sp  the  set  of  edges  of  P  and  by  Sp  the  set  of  interior  edges.  Thus, 
E  G  Sp  means  that  E  is  an  edge  of  some  A  G  P  and  that  the  interior  of  E  is  in  the 
interior  of  fl.  All  other  edges  are  called  boundary  edges.  We  also  denote  by  Vp  the  set  of 
all  vertices  v  of  P  and  by  Vp  the  set  of  interior  vertices.  Thus,  v  G  Vp  means  that  v  is 
a  vertex  of  one  of  the  A  G  P  and  v  is  in  the  interior  of  fl.  All  other  vertices  are  called 
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boundary  vertices. 

There  are  two  special  conditions  that  we  shall  impose  on  a  partition  P  that  are  im¬ 
portant  in  Finite  Element  constructions.  First,  we  say  that  a  partition  satisfies  a  minimal 
angle  condition  if  for  each  A  G  P  all  of  its  angles  are  >  a0  for  some  positive  number  a0. 
Second,  we  shall  require  a  partition  P  to  be  conforming  which  means  that  the  intersec¬ 
tion  of  any  two  cells  is  either  empty  or  a  common  edge  or  a  common  vertex.  A  family  of 
partitions  whose  elements  are  all  conforming  and  which  satisfy  a  minimal  angle  condition 
with  respect  to  a  common  constant  a0  >  0,  is  called  admissible. 

The  uniform  minimal  angle  condition  implies  that  for  each  cell  A  in  any  partition 
P  from  an  admissible  family  'P .  the  ratio  of  the  radii  of  the  smallest  circumscribed  and 
the  largest  inscribed  circle  of  A  is  uniformly  bounded  independent  of  A  and  P.  This 
is  sometimes  referred  to  as  local  quasi-uniformity  or  shape  regularity.  In  particular,  this 
implies  the  existence  of  a  constant  C  =  C{V)  such  that 

1  <  diam  ( A)2/ 1 A |  <  C  for  all  A  G  P,  P  G  V ,  (2.1) 

where  |A|  =  meas  (A)  denotes  the  Lebesgue  measure  of  A.  Moreover,  there  exists  a 
constant  G0  =  G0(V )  such  that  for  any  two  cells  A,  A'  G  P,  for  which  A  fl  A'  ^  0,  we 
have 

diam(A)  <  Godiam(A/).  (2.2) 

Locally  quasi- uniform  partitions  allow  one  to  control  the  global  error  in  approximating  a 
function  by  local  errors.  Typical  estimates  in  FEM  depend  on  a0  and  deteriorate  if  a0  is 
small. 

Conformity  favors  common  finite  element  data  structures  by  conveniently  relating 
local  and  global  stiffness  matrices  since  global  basis  functions  are  composed  of  the  local 
shape  functions  on  each  element  in  a  simple  way.  In  our  specific  context  the  global  basis 
functions  will  be  the  Courant  “hat  functions”  (nodal  functions)  <f>v,  v  G  Vp.  The  function 
(j)v  is  the  unique  element  in  Sp  which  is  one  at  v  and  is  zero  at  all  other  vertices  in  VP. 
The  nodal  basis  functions  are  locally  supported  on  the  union  of  all  triangles  which  share 
v  as  a  vertex.  Moreover,  one  can  construct  locally  supported  dual  functionals  with  the 
same  supports  consisting  of  (discontinuous)  piecewise  linear  functions  thereby  giving  rise 
to  local  linear  projectors  on  Sp  that  are  bounded  in  /P(Q)  for  s  <  1  (in  fact  even  beyond 
1).  Again,  it  is  important  for  estimating  errors  that  for  any  partition  P  from  an  admissible 
family,  each  basis  function  is  overlapped  by  a  uniformly  bounded  number  of  other  basis 
functions  since  the  valence  of  the  vertices,  i.e.  the  number  of  edges  emanating  from  a 
given  vertex,  remains  uniformly  bounded. 

The  adaptive  procedures  we  shall  consider  in  this  paper  will  generate  a  family  of 
partitions  which  is  admissible. 

A  typical  AFEM  generates  a  sequence  of  partition  P0,  Pi, . . . ,  Pn  by  using  rules  for 
subdividing  triangles.  Given  the  partition  Pk,  the  algorithm  marks  certain  of  the  trian¬ 
gular  cells  A  G  Pt  for  subdivision.  We  shall  denote  by  A4k  the  collection  of  marked  cells. 
These  marked  cells  are  subdivided  using  certain  subdivision  rules.  This  process,  however, 
creates  hanging  nodes.  We  say  that  v  G  Vp  is  a  hanging  node  for  A  G  P  if  v  appears  in 
the  interior  of  one  of  the  sides  of  A.  Since  hanging  nodes  obviously  violate  conformity, 
in  a  second  step  a  certain  collection  M.'k  of  additional  cells  are  subdivided  in  order  to 
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guarantee  that  the  resulting  partition  Pk+i  is  admissible.  The  partition  Pn  is  the  final 
admissible  partition  associated  with  this  application  of  the  adaptive  algorithm. 

2.1  Newest  vertex  bisection 

We  shall  restrict  ourselves  in  this  paper  to  a  very  specific  method  of  subdivision  known  as 
newest  vertex  bisection.  We  shall  call  on  certain  properties  of  this  method  of  subdivision 
in  what  follows.  We  could  not  find  some  of  these  properties  (or  proofs  of  these  properties) 
in  the  FEM  literature  and  therefore  our  discussion  and  development  of  newest  vertex 
bisection  will  be  somewhat  lengthy.  The  book  of  Verfiirth  [22]  and  the  research  article  of 
Mitchell  [16]  describe  this  subdivision  method  and  give  some  of  its  properties. 

Given  an  initial  partition  P0  of  Q,  to  each  A  e  P0,  we  assign  exactly  one  of  its  vertices 
v(A)  as  the  newest  vertex  for  that  cell.  This  initial  labelling  can  be  made  in  an  arbitrary 
way.  The  edge  in  A  opposite  to  v(A)  will  be  denoted  by  E( A).  In  Figure  1  the  newest 
vertex  assigned  in  any  triangle  of  P0  is  indicated  by  an  arrow  pointing  to  E( A). 


Each  triangular  cell  that  arises  in  the  adaptive  process  will  also  have  exactly  one 
of  its  vertices  designated  as  a  newest  vertex.  If  this  cell  is  to  be  subdivided  then  the 
subdivision  is  a  simple  bisection  done  by  connecting  the  newest  vertex  and  the  side  E{ A) 
opposite.  Thus  the  cell  produces  two  new  cells  and  their  newest  vertex  (assigned  to  each 
new  triangular  cell)  is  by  definition  the  midpoint  of  E( A). 

The  partitions  which  arise  when  using  newest  vertex  bisection  satisfy  a  uniform  mini¬ 
mal  angle  condition.  This  is  established  by  showing  that  all  triangles  that  arise  in  newest 
vertex  bisection  can  be  classified  into  a  set  of  similarity  classes  depending  only  on  the 
initial  partition  P0  (see  Mitchell  [16]).  Also  note  that  if  a  partition  P  is  created  by  a 
sequence  of  newest  vertex  bisections  and  if  P  has  no  hanging  nodes,  then  it  is  conform¬ 
ing.  Thus,  it  is  admissible.  We  shall  show  in  the  next  subsection  how  any  given  partition 
generated  by  newest  vertex  bisection  can  be  completed  to  a  partition  with  no  hanging 
nodes  by  subdividing  certain  other  triangular  cells.  This  process  is  called  completion. 
Furthermore,  we  shall  bound  the  number  of  additional  subdivisions  necessary  to  remove 
hanging  nodes.  But  first  we  want  to  examine  another  important  property  of  newest  vertex 
bisection  which  is  its  tree  structure. 
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We  can  represent  newest  vertex  bisection  subdivision  by  an  infinite  binary  tree  T* 
(which  we  call  the  master  tree).  The  master  tree  T*  consists  of  all  triangular  cells  which 
can  be  obtained  by  a  sequence  of  subdivisions.  The  roots  of  the  master  tree  are  the 
triangular  cells  in  P0.  When  a  cell  A  is  subdivided,  it  produces  two  new  cells  which  are 
called  the  children  of  A  and  A  is  their  parent.  It  is  very  important  to  note  that,  no  matter 
how  a  cell  arises  in  a  subdivision  process,  its  associated  newest  vertex  is  unique  and  only 
depends  on  the  initial  assignment  of  newest  vertices  in  P0.  This  means  that  the  children  of 
A  are  uniquely  determined  and  do  not  depend  on  how  A  arose  in  the  subdivision  process, 
i.e.,  it  does  not  depend  on  the  preceding  sequence  of  subdivisions.  The  reason  for  this  is 
that  any  subdivision  only  assigns  newest  vertices  for  the  new  triangular  cells  produced  by 
the  subdivision  and  does  not  alter  any  previous  assignment.  It  follows  that  71  is  unique 
and  does  not  depend  at  all  on  the  order  of  subdivisions. 

The  generation  of  a  triangular  cell  A  is  the  number  g( A)  of  ancestors  it  has  in  the 
master  tree.  Thus  cells  in  P0  have  generation  0,  their  children  have  generation  1  and  so 
on.  The  generation  of  a  cell  is  also  the  number  of  subdivisions  necessary  to  create  this 
cell  from  its  corresponding  root  cell  in  P0. 

A  subtree  T  C  T*  is  a  collection  of  triangular  cells  A  E  T*  with  the  following  two 

properties:  (i)  whenever  A  E  T  then  its  sibling  is  also  in  the  tree;  (ii)  when  A  C  A'  are 

both  in  the  tree  then  each  triangular  cell  A  E  T*  with  A  C  A  C  A'  is  also  in  T .  The 

roots  of  T  are  all  the  cells  A  E  T  whose  parents  are  not  in  T.  We  say  that  T  is  proper  if 

it  has  the  same  roots  as  T*,  i.e.,  it  contains  all  A  E  Pq. 

If  T  C  T*  is  a  finite  subtree,  we  say  A  E  T  is  a  leaf  of  T  if  T  contains  none  of  the 
children  of  A.  We  denote  by  C(T)  the  collection  of  all  leaves  of  T. 

For  a  proper  subtree  T,  we  define  N(T )  to  be  the  number  of  subdivisions  made  to 
produce  T. 

Any  partition  P  =  Pn  which  is  obtained  by  the  application  of  an  adaptive  procedure 
based  on  newest  vertex  bisection  (such  as  the  algorithms  we  consider  in  this  paper)  can 
be  associated  to  a  proper  subtree  T  =  T(P)  of  T*  consisting  of  all  triangular  cells  that 
were  created  during  the  algorithm,  i.e.  all  of  the  cells  in  P0, ,  Pn .  The  set  of  leaves 
C(T )  form  the  final  partition  P  =  Pn . 

We  shall  say  that  T  =  T(P)  is  admissible  if  P  is  admissible.  We  denote  the  class  of  all 
proper  trees  by  T  and  all  admissible  trees  by  Ta .  We  also  let  Tn  be  the  set  of  all  proper 
trees  T  with  N(T)  =  n  and  by  Tf  the  corresponding  class  of  admissible  trees  from  Tn ■  We 
denote  by  V  the  class  of  all  partitions  P  that  can  be  generated  by  newest  vertex  bisection 
and  by  V°  the  set  of  all  admissible  partitions.  Similarly,  Vn  and  V*  are  the  subclasses 
of  those  partitions  that  are  obtained  from  P0  by  using  n  subdivisions.  There  is  a  precise 
identification  between  Vn  and  Tn.  Any  P  G  Vn  can  be  given  by  a  tree,  i.e.  P  =  P(T )  for 
some  T  E  Tn-  Conversely  any  T  G  T„  determines  a  P  =  P(T)  in  Tn.  The  same  can  be 
said  about  admissible  partitions  and  trees. 

2.2  Completion  of  subdivision 

The  adaptive  algorithms  we  consider  in  this  paper  will  be  of  the  following  type.  We  begin 
with  Pq  and  mark  certain  cells  in  P0  for  subdivision.  After  doing  these  subdivisions  we 
arrive  at  the  partition  P[.  This  partition  is  not  necessarily  admissible  and  so  we  shall 
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make  some  additional  subdivisions  which  will  complete  P[  to  an  admissible  partition  P\. 
We  then  repeat  this  process  of  marking  and  completing.  It  will  be  important  for  us  to 
see  that  the  completion  process  does  not  seriously  inflate  the  number  of  triangular  cells 
in  P„.  We  have  not  found  any  result  in  the  literature  saying  that  the  overall  number 
of  triangles  created  through  completion  always  stays  proportional  to  the  number  of  cells 
marked  throughout  the  refinement  process.  To  establish  this  will  be  a  bit  technical  and 
will  be  the  subject  of  this  subsection. 

Suppose  that  P  is  an  admissible  partition  with  #(P)  >  2  (the  case  #(P)  =  2  is  trivial 
in  what  follows).  To  each  A  G  P,  we  associate  a  triangular  cell  F( A)  G  P  as  follows.  Let 
v(A)  be  the  newest  vertex  of  A  and  E( A)  the  edge  of  A  opposite  to  v(A).  If  E( A)  is 
a  boundary  edge  then  we  define  F( A)  =  0.  Otherwise,  there  is  a  unique  triangular  cell 
A'  7^  A  which  has  E( A)  as  one  of  its  edges  and  we  define  F( A)  =  A'.  One  can  visualize 
the  mapping  A  — >■  F(  A)  as  a  flow  determined  by  the  vector  which  serves  to  bisect  A  in 
the  subdivision  process. 

By  a  chain  0(A)  (with  starting  cell  A)  in  P,  we  mean  a  sequence  A,  F( A), . . . ,  Fm( A) 
with  no  repetition  of  the  cells  in  this  chain  and  with  Fm+1(A)  =  Fk( A),  for  some  k  G 
{0, . . . ,  m  —  1}  or  Fm+1(A)  =  0. 

Remark:  We  shall  see  below  that  by  starting  with  a  particular  assignment  of  newest 
vertices  in  P0,  for  any  of  the  subsequent  partitions  P  =  Pk,  the  only  way  Fm+1( A)  = 
Fn( A)  is  for  n  to  be  equal  to  m  —  1.  We  shall  therefore  assume  this  property  in  going 
further. 

The  completion  of  a  chain  0(A)  is  a  collection  0(A)  of  cells  produced  by  two  sets 
of  subdivisions.  In  the  first  set,  each  cell  A'  =  Fk\ A)  in  this  chain  is  subdivided  using 
the  newest  vertex  bisection  (i.e.  the  insertion  of  the  line  segment  connecting  v(A')  to  the 
midpoint  on  F( A')).  This  subdivision  of  A'  produces  two  new  cells  (the  children  of  A'). 

After  this  first  set  of  subdivisions  has  been  completed,  there  will  generally  be  cells  with 
hanging  nodes.  The  second  part  of  the  subdivision  process  is  to  subdivide  each  of  the 
children  that  have  a  hanging  node.  Hanging  nodes  occur  inside  a  cell  A'  =  Fk{ A),  when 
E(Fk~1( A))  7^  E(Fk( A)).  In  this  case  the  new  edge  we  need  to  add  in  Fk( A)  is  the  one 
connecting  the  midpoints  of  these  two  edges.  This  part  of  the  subdivision  process  removes 
all  remaining  hanging  nodes.  By  the  above  remarks,  this  has  the  effect  of  subdividing 
(into  two  grandchildren)  the  child  of  Fk( A),  k  =  1, . . .  ,m', which  has  E(Fk~1( A))  as  an 
edge.  Here  m'  Mm  —  1,  when  Fm+1(A)  =  Fm_1( A),  and  m!  =  ?n,  when  Fm+1(A)  =  0. 

We  shall  make  some  further  observations  about  the  structure  of  (7(A)  and  the  resulting 
flow  structure.  For  this  purpose,  we  shall  introduce  a  way  of  labelling  all  edges  that  arise 
in  the  subdivision  process. 

We  shall  label  the  edges  in  the  partitions  p).  P\, ... .  P„  by  nonnegative  integers.  This 
labelling  will  give  us  a  simple  way  to  keep  track  of  the  subdivision  and  completion  process. 
Given  any  triangular  cell  A  in  one  of  these  partitions,  the  sides  of  A  will  be  labelled  by 
(i  +  l,i  +  1  ,i)  where  i  =  g( A)  is  the  nonnegative  integer  that  represents  the  generation 
of  A  (i.e.  how  many  subdivisions  of  a  cell  in  P0  were  needed  to  create  A).  The  labelling 
will  be  such  that  the  lowest  labelled  side  will  be  F( A),  i.e.  the  side  opposite  the  newest- 
vertex  of  A.  At  the  outset,  it  may  appear  that  the  labelling  of  a  side  will  depend  on  the 
triangle  A  and  so  a  side  will  get  two  labels  depending  on  which  triangle  we  view  it  to 
be  in.  However,  as  we  shall  see,  for  admissible  partitions  the  labelling  of  an  edge  can  be 
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independent  of  the  triangle  to  which  it  belongs  provided  we  start  with  a  suitable  labelling 

ofP0- 

To  start  the  labelling  process,  we  describe  how  to  label  the  edges  in  P0,  see  Figure 
2  which  displays  a  labelling  which  is  consistent  with  the  assignment  of  newest  vertices 
shown  in  Figure  1. 


We  begin  with  the  following  lemma. 

Lemma  2.1  For  any  initial  partition  Po  there  is  a  labelling  of  the  edges  in  Po  such  that 
each  edge  is  given  a  label  of  either  0  or  1  and  whenever  a  triangle  A  6  P0  then  exactly 
two  of  its  edges  are  labelled  with  a  1  and  the  other  edge  is  labelled  with  a  0. 

Proof:  We  shall  show  the  existence  of  such  a  labelling  by  using  the  theory  of  matchings 
in  cubic  graphs.  A  cubic  graph  is  one  in  which  every  vertex  of  the  graph  has  exactly 
three  edges.  Since  the  labelling  refers  only  to  the  combinatorial  structure  we  can  map 
the  domain  Q  covered  by  Po  homeomorphically  onto  a  portion  of  the  sphere  in  FI  ’  which 
induces  a  triangulation  of  that  portion  of  the  sphere.  Let  Po  denote  any  fixed  completion 
of  Po  to  a  triangulation  of  the  whole  sphere  and  consider  the  dual  graph  of  Po-  This  is  the 
graph  whose  vertices  are  the  triangles  of  Po-  Two  vertices  in  this  dual  graph  are  connected 
if  the  corresponding  triangles  share  an  edge.  This  graph  is  a  cubic  graph. 

Given  any  graph,  a  set  of  edges  £  of  the  graph  such  that  no  two  edges  from  £  share 
the  same  vertex  is  called  a  matching.  The  matching  is  perfect  if  all  vertices  are  covered. 
The  existence  of  perfect  matchings  in  cubic  graphs  has  been  established  by  Petersen  [20], 
see  also  Theorem  3.4.1  on  page  110  in  [15].  Given  such  a  perfect  matching  £  for  the  cubic 
graph  generated  by  P0  (as  described  above),  we  assign  a  label  0  to  each  edge  of  P0  which 
corresponds  to  an  edge  in  £.  Every  other  edge  of  P0  is  assigned  the  label  1.  By  Petersen’s 
theorem,  for  each  triangle  A  in  P0,  exactly  one  of  the  edges  of  A  will  be  labelled  0  and 
the  other  two  will  be  labelled  1.  Clearly  this  labelling  of  Pq  induces  a  labelling  with  the 
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desired  property  also  for  any  subset  of  Po  and,  in  particular,  also  for  P0  which  establishes 
the  claimed  existence.  © 

There  remains  the  problem  of  constructing  a  practical  scheme  for  giving  a  labelling 
of  the  form  expressed  in  Lemma  2.1.  One  way  is  to  employ  methods  from  combinatorial 
optimization.  By  the  above  proof,  it  suffices  to  construct  a  maximal  matching  for  the 
dual  graph  of  P0.  Such  schemes  can  be  found,  for  instance,  in  [15,  p.  238,  Theorem  10.6]. 

To  explain  a  simple  alternative  let  us  consider  yet  another  interpretation  of  the  la¬ 
belling  in  Lemma  2.1.  For  any  collection  A  of  triangular  cells  in  P0  we  denote  by  O4  their 
union.  Suppose  we  could  find  a  set  Q  C  P0  with  the  following  properties: 

(i)  All  triangles  in  Pq  \  Q  have  at  least  one  edge  on  the  boundary  of  fb 

(ii)  The  domain  Qq  can  be  decomposed  into  an  essentially  disjoint  union  of  quadrilaterals 
formed  by  pairs  of  adjacent  triangles  from  Q. 

Given  such  a  collection  Q ,  we  can  assign  the  following  labels  to  the  edges  of  P0.  For 
each  pair  of  triangles  from  Q  whose  union  forms  one  of  the  quadrilaterals,  we  assign  0  to 
their  common  edge  and  1  to  all  other  edges.  By  (i)  we  have  missed  at  most  edges  on  the 
boundary  T  of  Q.  If  such  an  edge  E  belongs  to  a  triangle  with  two  interior  edges,  they 
must  have  the  label  1,  so  we  assign  the  label  0  to  E.  If  the  edge  belongs  to  a  triangle 
with  two  edges  on  the  boundary  we  label  one  by  0  and  the  other  one  by  1  to  obtain  the 
type  of  labelling  asserted  by  the  lemma. 


So  the  labelling  problem  now  reduces  to  finding  a  collection  Q  in  P0  satisfying  (i)  and 
(ii).  Instead  of  finding  such  a  set  Q  for  P0  whose  existence  is  guaranteed  by  Petersen’s 
Theorem,  we  shall  mention  a  simple  way  of  constructing  such  a  Q  for  a  refinement  of  Lo¬ 
in  fact,  we  can  subdivide  each  A  in  P  into  four  triangles  such  that  the  new  partition  P'} 
consisting  of  these  new  triangles  satisfies  (i)  and  (ii)  for  Q  =  Pq.  Indeed,  given  A  6  P, 
we  use  a  bisection  to  divide  A  into  two  triangles  (the  side  chosen  for  this  bisection  can  be 
chosen  arbitrarily).  Each  of  the  two  resulting  triangles  ( they  will  be  our  ’’quadrilaterals” 
from  Qq)  are  then  again  subdivided  by  bisecting  the  side  the  triangle  has  in  common  with 
A  The  last  two  bisecting  edges  receive  labels  0,  and  all  the  other  edges  resulting  from  this 
procedure  receive  labels  1.  If  this  is  done  for  each  A  E  P,  the  resulting  partition  Pj  has 
no  hanging  nodes  since  all  the  edges  of  P0  are  divided  into  two.  Thus  Pj  can  be  labelled 
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as  described  above,  see  Figure  3. 

Given  the  labelling  of  sides  in  P0  by  Lemma  2.1,  we  define  the  newest  vertex  of  a 
triangular  cell  A  6  P0  to  be  the  vertex  opposite  the  side  which  is  labelled  by  0.  In  going 
further,  we  shall  always  assume  that  the  initial  labelling  of  newest  vertices  in  P0  has  been 
done  in  accordance  with  Lemma  2.1.  Notice  that  this  means  that  any  chain  in  P0  has  at 
most  two  cells  and  that  the  subdivision  of  these  cells  gives  an  admissible  partition  (i.e. 
there  is  no  need  to  go  to  the  second  subdivisions  which  generated  grandchildren).  This 
is  illustrated  in  Figure  4  where  one  triangle  (from  P0  which  is  labelled  with  an  “x“)  has 
been  marked  for  subdivision. 


We  now  give  a  rule  to  label  any  edges  that  arise  from  the  subdivision-completion 
process.  There  will  be  two  main  properties  of  this  labelling.  The  first  is  that  each 
triangular  cell  will  have  sides  with  labels  (2,2,2  —  1)  for  some  positive  integer  i.  The 
second  is  that  the  newest  vertex  for  this  cell  will  be  the  vertex  opposite  the  side  with 
lowest  label.  Certainly  the  edges  in  P0  have  such  a  labelling  as  we  have  just  shown. 

Suppose  that  we  have  such  a  labelling  for  the  edges  in  Pk  and  let  us  describe  how  to 
label  the  edges  in  Pk+ Suppose  that  a  triangular  cell  A  £  Pk  has  sides  which  have  been 
labelled  (i,  i,i  —  1)  and  the  newest  vertex  for  this  cell  is  the  one  opposite  the  side  labelled 
z  —  l.  When  this  cell  is  subdivided  (using  newest  vertex  bisection)  the  side  labelled  z  —  1  is 
bisected  and  we  label  each  of  the  two  new  sides  2  +  1.  We  also  label  the  bisector  by  i  +  1, 
i.e.  the  new  edge  connecting  the  newest  vertex  of  A  with  the  midpoint  of  the  edge  E(  A) 
labelled  by  i  —  1.  Thus  each  new  triangle  now  has  sides  labelled  (i,  i  +  1,  i  +  1)  with  the 
newest  vertex  opposite  the  side  with  the  lowest  label.  We  note  the  important  fact  that  if 
a  cell  has  label  (2  +  1,2  +  1,2)  then  it  is  of  generation  i  (i.e.  it  has  been  obtained  from  a 
cell  in  P0  by  i  subdivisions).  Therefore,  specifying  that  the  generation  of  the  cell  is  i  is 
the  same  as  specifying  that  its  label  is  (2  +  1,  i  +  1, 2).  Figure  5  gives  an  example  for  the 
completion  process  which  corresponds  to  marking  one  of  the  triangles  of  P\  in  Figure  4 
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which  is  labelled  with  an  “x”  in  Figure  5. 


Figure  5:  Assignment  of  newest  vertices  in  P0 


Lemma  2.2  Suppose  P0  is  an  arbitrary  partition  and  its  edges  and  newest  vertices  are 
labelled  in  accordance  with  Lemma  2.1.  Suppose  that  P\, ...  ,Pn  are  partitions  which  are 
generated  from  Pq  using  the  marking,  subdivision,  and  completion  process.  We  label  edges 
in  Pi, ...  ,Pn  as  described  above.  Then  there  holds  for  each  k  =  0, 1, . . . , n: 

(i)  each  edge  in  Pk  has  a  unique  label  independent  of  the  two  triangles  which  share  this 
edge. 

(ii)  If  A  is  a  triangular  cell  in  Pk  of  generation  g( A)  =  i,  i.e.  the  edge  with  label  i  is  the 
side  shared  by  A  and  F(A),  then  g(F(A))  E  {i,i  —  1}.  If  g(F( A))  =  i  the  flow  ends  at 
F(  A). 

(in)  For  any  A  E  Pk  of  generation  g(  A)  =  i,  the  cells  in  its  chain 

(7(A)  =  {A,  F( A), . . . ,  Fm(A)}  have  the  property  that  g(FflA))  =  i  —  j,  j  =  0, . . . ,  m  —  1, 
and  the  terminating  cell  Fm(  A)  for  this  chain  is  either  of  generation  i  —  m  +  1  or  it  is  a 
boundary  cell  with  lowest  labelled  edge  an  edge  of  the  boundary. 

Proof:  This  is  proved  by  induction  on  k.  All  three  assertions  are  clear  for  k  =  0  by  the 
construction  of  the  labelling  of  P0  given  by  Lemma  2.F  Suppose  that  we  have  proven  the 
lemma  for  Pk-i  and  consider  Pk. 

Proof  of  (i):  Note  that  the  above  rules  leave  the  labels  of  all  those  edges  unchanged  that 
are  not  effected  by  subdivisions.  Any  edge  E  in  Pk  which  was  not  in  Pk-i  was  obtained 
in  one  of  two  ways.  The  first  is  that  it  is  a  new  edge  which  was  added  as  a  bisector  in  the 
subdivision-completion  process.  In  this  case  there  is  nothing  to  prove  about  its  labelling 
being  unique.  The  second  possibility  is  that  the  new  edge  E  was  obtained  by  bisecting 
an  edge  E' ,  say  with  label  i,  from  Pk-\.  Let  A  and  A'  be  the  two  triangular  cells  in  Pk-i 
which  shared  Eh  For  one  of  these  triangular  cells,  which  we  can  assume  is  A',  we  have 
E'  =  E(A').  So  A'  had  label  (i  +  1,  *  +  1,  i)  and  therefore  viewed  from  A',  E  is  assigned 
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the  label  %  +  2.  By  our  induction  assumption,  as  an  edge  of  A,  E'  is  also  labelled  by  i.  so 
that  A  has  either  the  label  (i,i  +  l,i  +  1)  or  ( i,i,i  —  1).  In  the  first  case  the  situation  is 
symmetric  to  A'  since  E'  =  E(  A)  is  bisected  and  E  as  one  of  the  halves  is  labelled  i  +  2. 
In  the  second  case  E'  is  still  an  edge  of  one  of  the  children  of  A  obtained  by  bisecting 
the  edge  with  label  i  —  1.  Thus  the  midpoint  of  E'  is  still  a  hanging  node  in  that  child 
which  is  now  labelled  ( i  +  1,  i  +  1,  i).  This  is  the  situation  described  by  the  first  case  which 
finishes  the  proof  of  (i). 

Proof  of  (ii):  Since  A  has  generation  i,  E( A)  has  label  i.  Since  this  edge  is  shared  with 
F(A),  we  conclude  that  the  sides  of  P( A)  have  either  labels  ( i ,  i,i  —  1)  or  (i  +  1,  i  +  1,  i). 
Also,  in  this  latter  case  P2( A)  =  A. 

Proof  of  (iii)  This  follows  from  (ii).  © 

Note  that  all  admissible  partitions,  generated  by  newest  vertex  bisection  based  on  an 
initial  labelling  according  to  Lemma  2.1,  are  graded  in  the  sense  that  any  two  cells  sharing 
an  edge  differ  in  generation  by  at  most  one. 


We  shall  next  give  a  bound  for  the  number  of  cells  in  Pn.  In  preparation  for  this, 
let  us  note  that  there  are  constants  c1}  C\  depending  only  on  P0  such  that  for  each  A  of 
generation  i ,  we  have  Ci2_*  <  |A|  <  Ci2~l  where  | A |  =  meas(A)  is  the  area  of  A.  Indeed, 
each  subdivision  of  a  cell  gives  two  cells  with  each  having  half  the  area  of  the  original  cell. 
By  adjusting  the  constants  if  necessary,  we  also  infer  from  (2.1)  that 

ci2“i/2  <  diam(A)  <  C'i2“i/2,  g{ A)  =  i,  (2.3) 

because  Pn  belongs  to  an  admissible  family  with  parameters  C.  Go,  ao  depending  only  on 
the  initial  partition  Po.1 

Using  (2.3),  we  can  bound  the  distance  between  any  two  cells  from  a  chain  (7(A).  Let 
the  cell  Fp( A)  be  of  generation  7  =  g{Fp( A))  and  let  Fq( A)  be  another  cell  from  (7(A), 
where  0  <  q  <  p  <  m.  Then  the  distance  between  these  two  cells  can  be  bounded  by 
the  sum  of  the  diameters  of  the  cells  in  the  chain  (7(A)  between  them.  According  to  (iii) 
from  Lemma  2.2,  the  generation  of  these  cells  decrease  exactly  by  one,  and  therefore 


p-q- 1 

dist(Pp(A),P«(A))  <  2“(7+l)/2  < 

i= 1 


y/2 

V2-1 


CV  2“7/2  . 


(2.4) 


Let  A4  :=  vU  Mj  be  the  collection  of  all  cells  that  were  marked  in  going  from  P0  to 
Pn.  Here  is  the  way  to  view  the  following  argument  that  will  bound  the  number  of  cells 
in  Pn.  We  give  each  cell  A'  G  M  a  fixed  number  C  >  0  of  dollars  to  spend.  These  cells 
will  spend  these  dollars  in  such  a  way  that  each  new  cell  that  was  created  in  going  from 
Po  to  P„  will  get  at  least  c  >  0  dollars  where  c  is  an  absolute  constant.  This  means,  we 
can  bound  the  number  of  new  cells  created  by  the  number  of  marked  cells. 

1  Generic  constants  whose  value  may  vary  on  each  occurrence  will  be  denoted  by  C.  Whenever  the 
specific  value  of  a  constant  matters  we  shall  use  subscripts.  Using  the  same  subscript  for  different 
constants  indicates  that  they  occur  in  the  same  type  of  estimates.  It  is  then  understood  that  they  may 
need  to  be  adjusted  at  each  occurrence. 
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We  now  describe  how  a  cell  A'  e  M  will  spend  its  money.  We  define 

OO 

A  :=  7C\  2~j/ 4  (2.5) 

j=- 1 

where  C\  is  the  constant  in  (2.3).  We  define  now  a  function  A  :  Pn  x  JA  — *  M  as  follows: 

(  (j  ~k  +  2)-2,  if  g(A')  =  j,  g( A)  =  k,  dist(A,  A')  <  A2~k/ 2 
A(A,  A')  :=  <  and  k  <  j  +  1,  (2.6) 

[  0,  otherwise. 

The  quantity  A(A,  A')  is  the  portion  of  money  which  is  spent  by  the  marked  cell  A'  on 
nearby  cells  A  E  Pn  of  generation  at  most  g{A')  +  1.  Given  A7  E  M  with  g( A7)  =  j, 
there  are  for  any  k  <  j  +  1  at  most  C'  cells  A  G  P„  of  generation  g( A)  =  k  which  satisfy 
dist(A,A7)  <  A2~k!2  (see  (2.3)),  where  C'  is  an  absolute  constant.  Hence  for  any  such 
A7  E  JA  one  has 

OO 

Y,  A(A,A')<C'y],/-2  =  C  (2.7) 

AeP„  v=i 

with  C  an  absolute  constant.  It  follows  then  from  (2.7)  that 

£  E  A(A,A')<C#(Afl,  (2.8) 

A  'EM  a  ePn 

i.e.  the  total  amount  of  money  spent  by  all  the  marked  cells  is  proportional  to  their 
number.  Conversely,  each  cell  in  Pn  receives  at  least  a  minimum  share  bounded  away 
from  zero  as  will  be  shown  next. 

Lemma  2.3  For  any  A  6  Pn  \  P0  we  have 

£  A(A.A')  >c  (2.9) 

A'e  M 

where  c  >  0  is  an  absolute  constant. 

Proof:  We  fix  A  and  let  k  :=  g( A)  >  1  be  its  generation.  We  are  going  to  define  a 
sequence  of  marked  cells  Ai,. . . ,  As  G  A4  associated  to  A  with  each  A,,  of  generation 
>  k  —  1.  Ai  is  the  marked  cell  such  that  A  E  C'(Ai),  i.e.  A  arose  by  subdividing  the 
chain  associated  to  Ax.  Given  that  A  j  has  been  defined,  we  let  AJ+1  E  Ai  be  the  marked 
cell  such  that  A  j  E  C'(Aj+1).  We  let  s  be  the  smallest  integer  such  that  g(  As)  =  k  —  1. 
Note  that  there  must  be  such  an  integer  because  subdividing  a  chain  can  only  increase 
the  highest  generation  in  the  chain  by  one.  For  each  j,  we  let  A;  denote  the  parent  of  A; 
from  the  chain  C(Aj+i),  i.e.  A  j  C  A  j. 

For  each  j  >  —1,  we  keep  a  running  count  m(i,j )  of  the  number  of  cells  A„  with 

=  j  +  k,  V  <  i. 

CASE  1:  There  is  a  j  E  {  —  1,  0, . . .}  such  that  m(i,j)  >  2 J’/4  for  some  1  <  i  <  s.  In  this 
case,  we  choose  j*  as  the  integer  which  has  the  smallest  i  (defined  to  be  i*  :=  i )  with  this 
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property.  In  other  words,  j*  is  the  integer  j  whose  count  first  exceeds  2-y/’4  and  i*  is  the 
smallest  i  for  which  m(i,j*)  >  2-T/4.  It  follows  from  (2.4)  and  g( A^)  —  g( A^)  <  2  that 


dist(A,  A„)  <  ^  C,  ^  2-(9(a")-2)/2  <  1CX  J2  2“9(A'i)/2  . 

*  /i=l  /i=l 

and  therefore  for  v  <  i* 

OO  OO 

dist(A,  A„)  <  7 Ci  J]  m{n,j)2~^/2  <  7C i2“fc/2  ^  2“j/4  =  A2“fc/2. 
j=-i  j=-i 


(2.10) 


(2.11) 


Thus,  for  each  Aj,,  z/  <  i*,  with  g(Av)  =  j*  +  k,  we  have  A(A,  Av)  =  ( j *  +  2)  2.  Since 
there  are  at  least  2J*/4  such  values  of  v,  we  obtain 


J]  A(A,  A')  >  {j*  +  2)~22j*/4  >  c  (2.12) 

A  'EM 


with  c  :=  minI/>_i(t'  +  2)~22UA  an  absolute  constant.  This  is  (2.9)  in  this  case. 

CASE  2:  In  this  case,  for  all  j  >  —1  we  have  m(s,j )  <  2J//4.  Therefore,  as  in  (2.11),  we 
have 

OO 

dist(A,  As)  <  7Ci2~k/2  ^  2~j/ 4  =  2l2“fc/2.  (2.13) 

3=~ 1 

This  means  A(A,  As)  =  1  and  again  we  have  (2.9).  © 


Theorem  2.4  Suppose  that  Pq,  . . .  ,Pn  is  a  sequence  of  partitions  generated  as  described 
above.  Then,  there  is  a  constant  C2  >  0  depending  only  on  P0  such  that 

<  #(Po)  +  C2(#(M0)  +  . . .  +  #(M„_i)).  (2.14) 

Proof:  It  follows  from  Lemma  2.3  and  (2.8)  that  the  number  of  new  cells  added  in  the 
subdivision  and  completion  process  does  not  exceed  C(#( A4o)  +  . . .  +  #(A4n_i))/c  and 
so  (2.14)  holds  with  C2  :=  C/c.  © 

We  shall  use  the  remainder  of  this  section  to  spell  out  different  settings  in  which  we 
shall  use  completion  and  formulate  variants  of  Theorem  2.4  for  these  settings. 

It  follows  from  Theorem  2.4  that  to  bound  the  cardinality  #(P)  of  a  partition  P  that 
arises  through  markings,  refinements,  and  then  completion,  we  need  only  keep  control  of 
the  number  of  markings  used  in  the  creation  of  P.  Given  a  partition  P  and  a  second 
partition  P'  that  is  obtained  from  P  by  first  marking  some  cells  for  subdivision  and 
then  doing  a  completion,  we  shall  use  the  notation  m(P'|P)  to  denote  the  number  of 
markings  that  were  used  to  go  from  P  to  P' .  Of  course,  there  may  be  many  ways  to  mark 
and  complete  to  go  from  P  to  P'  but  the  particular  method  (and  therefore  the  number 
m(P'|P))  will  always  be  clear  in  the  context  of  the  algorithm  under  discussion.  Thus, 
(2.14)  can  be  rewritten  as 

n 

#(P„)  <#(Po)  +  C2^m(Pfc|Pfc_1).  (2.15) 

k=  1 
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Up  until  now,  we  have  restricted  our  discussion  to  the  setting  where  a  completion  is 
done  after  each  set  of  markings  and  elementary  subdivisions.  We  can  also  apply  comple¬ 
tion  in  the  following  setting.  Suppose  P  is  an  admissible  partition  and  P  is  a  refinement 
of  P  which  is  not  necessarily  admissible.  We  complete  P  to  an  admissible  partition  P' 
by  a  sequence  of  markings  and  elementary  refinements  as  follows.  We  define  Ado  to  be 
the  set  of  all  A  E  P  which  are  not  in  P  (i.e.  the  cells  A  E  Ado  were  subdivided  in  going 
from  P  to  P).  We  perform  the  subdivisions  for  the  markings  Ado  and  then  form  the 
completion  to  arrive  at  the  admissible  partition  P1  which  is  a  refinement  of  P.  We  let 
Adi  denote  the  set  of  all  A  E  P\  that  are  further  refined  in  going  from  P  to  P  and  let 
P2  be  the  admissible  partition  obtained  after  marking  the  cells  Adi  and  then  completing. 
Continuing  in  this  way  we  arrive  at  the  first  admissible  partition  Pm  which  is  a  refinement 
of  P. 

Lemma  2.5  Let  P  be  any  admissible  partition  and  let  P  be  a  refinement  of  P.  Then, 
there  is  a  completion  P'  of  P  which  is  an  admissible  partition  that  refines  P  and  satisfies 

mn  <  #(P0)+C2(m(P\P0)+m(P\P))  <  #(P0)+C'2(#(P)-#(Po))  <  C2#(P).  (2.16) 

Proof:  Each  of  the  cells  in  Ado  U  Adi  U  •  •  •  U  Adm_i  was  subdivided  in  going  from  P  to 
P  and  hence  £ #(Mk)  =  m(P\P)  <  #(P)  -  #(P).  Also  m(P|P0)  <  #(P)  -  #(P0). 
Using  this  in  (2.15)  we  obtain  (2.16).  © 

3  Galerkin  approximations 

Numerical  methods  based  on  Galerkin  approximations  to  (1.1)  begin  with  the  weak  for¬ 
mulation  of  (1.1)  which  is  to  find  u  G  Hq(Q)  such  that 

a(u,w)  =  (f,w),  w  e  (3.1) 

where  a(y,w )  :=  (Vy,  Vio),  (y,w)  =  ( y,w)n  :=  fn  ywdx.  We  use  the  notation 

Mil2  :=  a(w,w)  =  ||Vw||i2(Q).  (3.2) 

By  Poincare’s  inequality  there  exists  a  constant  cq,  depending  on  fi,  such  that  for  any 
w  G 

cnMllHi(n)  <  <  M||Hi(n),  (3.3) 

where  M||^1(n)  =  ||w|||a(n)  +  || V«j||ia(n). 

Given  an  admissible  partition  P,  we  shall  denote  the  Galerkin  solution  to  (3.1)  by  up 
throughout  this  paper.  Thus  up  is  the  unique  element  in  Sp  C  Hq(Q)  which  satisfies 

a(up,w)  =  (f,w),  w  G  Sp.  (3.4) 

From  a  practical  point  of  view  it  would  be  better  in  (3.4)  to  replace  /  by  an  approximation 
and  consider  the  solution  of  this  modified  variational  problem  as  up.  But  we  want  to 
conform  exactly  to  the  algorithm  in  [17]  so  that  we  do  not  have  to  detour  through  a  new 
development  of  a  posteriori  error  analysis  in  this  modified  setting. 
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The  Galerkin  solution  up  is  a  best  approximation  to  u  from  Sp  in  the  energy  norm 

I  it  —  up  |  =  inf  |||u  —  S\\.  (3.5) 

seSp 

We  cannot  calculate  u p  exactly.  We  therefore  introduce  the  following  numerical 
scheme  in  which  the  constant  0  <  5  <  1  is  at  this  point  arbitrary  but  will  be  speci¬ 
fied  later. 

GAL:  This  algorithm  takes  as  input  an  admissible  partition  P ,  an  error  tolerance  p  and 
an  initial  approximation  up  G  Sp  to  up  that  satisfies 

I  it  —  ftp  I  <  A'p  (3.6) 

with  A'  a  fixed  constant.  It  applies  a  preconditioned  conjugate  gradient  scheme  with  initial 
guess  up  to  obtain  an  approximation  up  :  =  GAL(P,  /i,  Up)  to  up  that  satisfies 


Up  —  Up  <  Sp. 


(3.7) 


Remark  3.1  The  number  of  iterations  of  the  preconditioned  conjugate  gradient  scheme 
needed  to  achieve  (3.7)  depends  only  on  the  quotient  A' /8.  Since  each  iteration  requires  at 
most  Cjf(P)  computations,  it  follows  that  the  number  of  computations  iV(GAL,  P ,  p,  up) 
satisfies 

N(GAL,P,p,uP)  <  C3(A'/6)#(P),  (3.8) 

where  C3  :  t  — >■  C3(t)  increases  as  a  function  oft. 

Proof:  Since  |||ii  — «p|||  <  |||it  —  up\\  we  have  ||itp  —  f2p|||  <  2 A'p.  Thus  the  target  accuracy  in 
(3.7)  is  a  fixed  fraction  of  the  deviation  of  the  initial  guess  from  the  exact  Galerkin  solution 
Up.  The  BPX-scheme  allows  one  to  precondition  the  stiffness  matrices  corresponding  to 
nonuniformly  refined  meshes  in  such  a  way  that  the  condition  numbers  stay  uniformly 
bounded,  [11,  13,  18].  Since  the  error  reduction  in  one  iteration  of  the  conjugate  gradient 
scheme  then  reduces  the  current  error  by  a  multiplicative  factor  strictly  less  than  one, 
only  a  fixed  number  of  iterations  depending  only  on  A' / 5  is  needed  to  achieve  (3.7).  Each 
application  of  the  preconditioner  requires  only  Cff(P)  operations.  © 

We  shall  see  that  in  our  later  applications  of  GAL  the  constant  5  will  be  fixed  once 
and  for  all  and  that  there  exists  an  A'  such  that  (3.6)  is  satisfied  in  each  application. 


4  Adaptive  approximation 

In  this  section,  we  shall  discuss  adaptive  approximation  of  a  function  w  which  is  known 
to  us  and  for  which  we  can  compute  local  polynomial  approximants.  These  results  do 
not  apply  directly  to  u  since  it  is  unknown  but  they  serve  to  tell  us  what  is  the  best  we 
can  expect  in  terms  of  approximating  u.  We  shall  also  use  the  approximation  methods 
we  develop  in  this  section  in  parts  of  our  adaptive  algorithm,  namely  to  numerically 
approximate  the  right  hand  side  /  and  to  execute  our  coarsening  step. 
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We  limit  ourselves  to  adaptive  methods  based  on  subdivision  using  the  newest  vertex 
subdivision  rule  starting  with  an  initial  partition  P0  and  a  labelling  of  vertices  as  given 
in  Lemma  2.1.  We  recall  that  any  adaptively  generated  partition  P  can  be  associated  to 
a  tree  T(P )  which  is  a  proper  subtree  of  T*.  The  leaves  of  T(P )  give  the  partition  P. 
Conversely,  any  finite  proper  subtree  T  gives  a  partition  P  consisting  of  the  leaves  of  T. 

Recall  that  Vn  denotes  the  set  of  all  adaptively  generated  partitions  which  are  obtained 
from  P0  by  applying  at  most  n  elementary  subdivisions.  P0  consists  of  the  single  partition 
Pq.  The  partitions  in  Vn  correspond  to  trees  T  =  T(P )  which  satisfy  N(T)  =  n.  Similarly, 
we  denote  by  V"  all  the  partitions  in  Vn  which  are  admissible.  We  also  recall  our  notation 
Sp  of  continuous  piecewise  linear  functions  subordinate  to  P  which  vanish  on  <912. 

4.1  Adaptive  approximation  in  the  H1^)- norm 

Given  a  function  w  G  //(J  ( 12 )  and  a  partition  P,  we  define 

E(w,SP)Hi,n)  :=  inf  \\w  -  S||Hi(n),  (4.1) 

seSp 

which  is  the  smallest  error  we  can  achieve  by  approximating  w  in  the  IP  (12)  norm  by  the 
elements  of  Sp.  In  the  case  that  w  =  u  is  our  solution  to  (1.1)  and  P  is  admissible  then 
in  view  of  (3.3) 

I u  —  up  I  <  E(u,SP)m{n)  <  cq1||m  —  up  I,  (4.2) 

where  up  is  the  Galerkin  approximation  associated  to  P. 

Returning  to  the  general  case  of  a  w  G  II(\  ( 12 ) ,  we  enter  a  competition  over  all  partitions 
P  G  Vn  and  introduce  the  error 

crn(w)  :  =  inf  E(w,SP)H i(0)  (4.3) 

PeVn 

of  best  adaptive  approximation. 

It  is  unreasonable  to  expect  any  adaptive  algorithm  to  perform  exactly  the  same  as 
<rn(w).  However,  we  may  expect  the  same  asymptotic  behavior.  To  quantify  this,  we 
introduce  for  any  s  >  0,  the  class  As  :=  As{Hq(Q))  of  functions  w  G  Hq(  12)  such  that 

an( w)  <  Mn~s ,  n  =  1,  2, ... .  (4.4) 

The  smallest  M  for  which  (4.4)  is  satisfied  is  the  norm  in  As: 

IMUs  :=  SUP nsan(w).  (4.5) 

n>  1 

We  have  a  similar  measure  of  approximation  when  we  restrict  ourselves  to  admissible 
partitions.  Namely, 

cr“(w)  :=  inf  E(w,SP)H i(n)  (4.6) 

Pen 

now  measures  the  best  nonlinear  approximation  error  obtained  from  admissible  partitions 
and  As  :=  As(Pq(12))  consists  of  all  w  which  satisfy 

<  Mn~s ,  n  =  l,2, ....  (4.7) 

The  smallest  M  for  which  (4.7)  holds  serves  to  define  the  norm  ||ty||^s. 
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Remark  4.1  We  have  As(Po(12))  =  As(Hq  (12))  with  equivalent  norms. 

Indeed,  given  any  partition  P  G  Vn  (not  necessarily  admissible)  which  achieves  the  error 
E(w,Sp)Hi (Q)  =  an(w),  we  can  use  Lemma  2.5  to  complete  P  to  an  admissible  partition 
P1.  Since  #(P')  <  (72#(P)  (see  (2.16)),  and  the  error  on  the  partition  P'  does  not  exceed 
the  error  on  P  and  therefore 

VC2n(w)  <  °n{w)  (4.8) 

from  which  the  remark  easily  follows. 

The  reader  should  be  interested  to  know  what  functions  are  in  As.  It  turns  out 
that  these  classes  are  related  to  certain  Besov  spaces.  Since  we  do  not  use  or  need 
this  information  in  the  construction  of  the  algorithm,  we  postpone  this  discussion  to  the 
Appendix. 


4.2  Adaptive  approximation  in  H~\n) 


We  shall  also  need  approximation  by  piecewise  constants  which  will  be  our  vehicle  for 
resolving  the  right  hand  side  /  in  our  numerical  algorithm  for  (1.1).  The  approximation 
will  take  place  in  the  P_1(l 2)  norm  which  is  defined  for  a  tempered  distribution  g  by 
duality: 


sup 

<t>£H 1(0) 


1011  ’ 


(4.9) 


where  (-,  •)  denotes  the  duality  pairing  induced  by  the  standard  L2-inner  product.  Given 
a  partition  P,  we  let  Sp  denote  the  class  of  piecewise  constant  functions  subordinate  to 
P.  For  a  function  g  G  P_1(l 2),  we  have 


E(g,Sp)H-i^)  :  —  inf  \\g  —  Sr||H^?(n),  (4-10) 

ses° 

which  is  the  best  error  we  can  achieve  by  approximating  /  in  the  P-1(f2)-norm  by  elements 
of  Sp.  Analogously,  we  have 


Vn(g)H- J(n)  :=  inf  E(g,S°P)H- i(n) 

PePn 


(4.11) 


the  error  of  best  nonlinear  approximation  to  g  by  piecewise  constants. 

As  in  the  case  of  piecewise  linear  approximation,  we  introduce  for  any  s  >  0,  the 
approximation  class  AS(P_1(1 2))  and  its  norm  exactly  as  in  (4.4)  and  (4.5)  except  that 
we  use  crn(g) in  place  of  crn(w).  We  have  a  similar  measure  of  approximation  when 
we  restrict  ourselves  to  admissible  partitions.  We  denote  the  approximation  class  in  this 


case  by  As(P_1(f2)) 

Suppose  now  that  g  G  L2(f2).  Then  g  G  P_1(f 2).  If  P  is  any  partition  of  12  and 
A  G  P,  we  define 


(4.12) 
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which  is  the  average  of  g  over  A.  Also,  cja  is  the  best  approximation  to  g  in  L2( A)  by 
constant  functions.  Therefore, 


Sp{g)  :=  ^2  gAXA  (4.13) 

AeP 

is  the  best  L2(0)  approximation  to  g  by  piecewise  constants  subordinate  to  P. 

We  can  use  Sp(g)  also  to  approximate  g  in  the  //~'  (Q)-norin.  In  fact,  for  any  admis¬ 
sible  partition  P,  we  have  the  following  bound  for  the  approximation  error 

£(9,  S»  )?,-,(0)  <  ||9  -  S»(9)|li-,(n)  <  C0E(g,P)  (4.14) 

where  _ 

£(9,£)iie.^|A|||9-9a|||l(a). 

AeP 

Indeed,  to  see  that  this  is  true,  let  <t>  be  any  function  in  II'}  (Q)  of  unit  norm 
S°P  is  the  orthogonal  projection  to  Sp,  we  have 

(g  ~  S°P{g),  4>)  =  {g-  Sp(g),  0  -  S?>(0))  =  ^  [  (0  -  0a)(0  -  0a)-  (4.16) 

A  eP^A 

We  use  the  Cauchy-Schwartz  inequality  on  each  of  the  terms  in  the  last  sum  and  then  the 
Poincare  inequality  to  bound  such  a  term  by 

II 9  -  9a | |l2(A) ||0  -  0a I |l2(A)  <  C\A\1/2\\V(l)\\L2(A)\\g  -  9a\\l2(A)- 

Here  C  is  an  absolute  constant  because  all  the  triangular  cells  A  are  uniformly  shape 
regular  (2.1).  Using  this  in  (4.16)  and  again  applying  Cauchy- Schwarz,  we  arrive  at 
(4.14)  by  taking  a  supremum  over  all  (j)  of  norm  one  in  Hq  (U). 

We  can  use  E(g,P)  to  define  another  nonlinear  approximation  error: 

a2n(g):=  inf  E(g,P).  (4.17) 

PePn 

Analogous  to  the  approximation  classes  defined  above,  we  define  *4s(f7-1(f2))  using 
an(g)  and  the  norm  for  this  class  as  before.  When  dealing  with  admissible  partitions 
we  shall  denote  this  class  by  a4ls(i7_1(ff)).  As  before  ^ts(f7-1(f2))  =  *4s(f7-1(f!))  with 
equivalent  norms. 

4.3  An  algorithm  for  adaptive  approximation  of  a  given  function 

In  this  section,  we  wish  to  describe  some  of  the  results  of  [2]  which  give  adaptive  approxi¬ 
mation  algorithms  for  approximating  a  given  target  function  v  defined  on  Q  in  a  specified 
norm.  These  algorithms  are  different  from  an  AFEM  since  they  assume  that  the  target 
function  v  is  fully  known  (whereas  our  solution  u  to  (1.1)  is  not).  We  shall  use  these 
algorithms  in  two  different  settings  which  we  shall  describe  in  the  following  subsections. 

Although  the  algorithms  in  [2]  apply  in  a  more  general  setting,  we  shall  limit  our 
discussion  to  the  case  of  newest  vertex  bisection  and  its  associated  master  tree  T*.  The 


(4.15) 


Then,  since 
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starting  assumption  is  that  there  is  a  functional  e  which  associates  to  each  triangular  cell 
A  in  the  master  tree  a  nonnegative  real  number  e(A).  In  applications,  e(A)  is  usually 
some  local  approximation  error  (or  a  bound  for  this  local  error)  associated  to  A. 

There  are  two  algorithms,  called  Second  Algorithm  and  Modified  Second  Al¬ 
gorithm  in  [2],  that  we  shall  utilize.  We  shall  limit  our  discussion  in  this  section  to 
the  Second  Algorithm.  The  Modified  Second  Algorithm,  which  is  a  variant  of  the 
Second  Algorithm  will  bee  introduced  in  §4.5  when  we  shall  need  it  for  coarsening. 

The  Second  Algorithm  in  [2]  can  be  applied  under  the  assumption  that  e  satisfies 
the  following  property:  for  each  A  G  T*  and  each  subtree  T  which  contains  A  as  its  only 
root,  we  have 

Y1  «(A')  <  c«(A).  (4-18) 

a  'ezr(T) 

where  C  is  an  absolute  constant  and  C(T)  is  the  set  of  leaves  of  T.  Note  that  (4.18) 
follows,  in  particular,  when  e  is  subadditive,  i.e. , 

e(Ax)  +e(A2)  <  e(A).  (4.19) 

holds  for  any  A  G  T*  and  its  children  Ai  and  A2.  In  this  case,  C  =  1  in  (4.18).  We  shall 
use  the  Second  Algorithm  in  §4.4  when  we  approximate  the  right  hand  side  /  of  (1.1). 
For  any  proper  subtree  T  C  T„  we  define 

E(T):=  Y.  e<A)  <4'20> 

A  e£(T) 

as  the  error  associated  to  T.  In  applications  to  adaptive  partitioning,  E(T )  would  corre¬ 
spond  to  the  square  of  the  error  associated  to  the  partition  given  by  the  leaves  of  T. 

Recall  that  T0  :=  T(P0 )  and  that  for  any  tree  T  which  is  a  refinement  of  T0  the  number 
N(T)  is  the  number  of  subdivisions  necessary  to  create  T  from  T0;  it  is  also  the  number 
of  interior  nodes  in  T  starting  from  the  root  T0.  We  let  Tn{To)  be  the  set  of  all  trees  T 
with  N(T )  =  n. 

We  enter  the  following  competition  among  all  proper  trees  in  Tn(T0): 

En  :=  En(T0)  :  =  inf  E(T).  (4.21) 

TeT„(T0) 

So  En  is  the  smallest  error  we  could  achieve  using  trees  from  Tn{To).  Although  the  problem 
is  finite,  it  is  numerically  too  intensive  to  find  a  best  tree  which  achieves  the  minimum 
error  En  because  #(Tn{To))  is  exponential  in  n. 

Suppose  that  given  any  A,  we  are  able  to  compute  e(A).  We  would  like  to  find  a 
tree  which  performs  almost  as  well  as  the  best  tree  from  Tn{To )  and  to  do  so  while  only 
computing  0(n  +  #(T0))  values  e(A).  The  main  result  of  [2]  is  to  show  that  this  is 
possible.  Namely,  that  paper  gives  a  numerically  realizable  algorithm  which  yields  near 
best  trees  T  from  Tn{To )  by  which  we  mean 

E(T)  <  C0ECon  (4.22) 

where  >  0  are  absolute  constants  (in  particular  they  do  not  depend  on  the  starting 

partition  P0). 
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The  algorithm  finds  a  near  best  tree  for  the  given  value  of  n  by  creating  a  sequence 
of  trees  T  =  Tj,  j  =  1,  2, . . .  starting  from  the  initial  tree  T0.  Roughly  speaking,  at  any 
given  stage  in  the  algorithm,  it  computes  e(A)  for  all  leaves  in  the  current  tree  and  then 
subdivides  the  leaves  with  the  largest  e-value.  However,  to  avoid  a  lengthy  sequence  of 
subdivisions  with  insufficient  reduction  in  the  error,  the  algorithm  in  actuality  uses  a 
modified  functional  e.  What  is  important  in  the  context  of  the  present  paper  is  that  to 
create  the  tree  in  (4.22)  requires  the  computation  of  at  most  #(T0)  +  Cn  values  of  e  where 
T0  is  the  starting  tree. 

We  shall  use  this  algorithm  in  the  following  setting  described  in  [2]  as  the  Thresh¬ 
olding  Second  Algorithm.  Given  an  admissible  partition  P  which  is  not  necessarily 
Po  and  the  corresponding  tree  T(P),  let  now  T0  :=  T(P)  be  our  set  of  root  nodes.  For 
any  tree  T  which  is  a  refinement  of  T0  the  number  N(T\T0)  denotes  now  the  number  of 
subdivisions  necessary  to  create  T  from  T0.  So  N(T\T0)  =  N(T )  when  T0  =  T(P0).  Given 
a  tolerance  p  >  0,  and  an  admissible  partition  P  the  algorithm  produces  a  tree  T),  which 
is  a  refinement  of  T0  =  T(P )  with  the  following  properties  (see  Corollary  5.4  of  [2]): 

PI:  satisfies 

E(T,)  <  p.  (4.23) 

P2:  For  absolute  constants  C\  and  C\  (independent  of  P),  it  holds  that  whenever  T  is  a 
refinement  of  T(P )  satisfying 


E(T)  <  cip, 

(4.24) 

N(Th\T0)  <  CiN(T\T0). 

(4.25) 

Since  we  are  interested  in  admissible  partitions,  we  shall  add  a  completion  step  on  the 
end  of  this  algorithm  to  obtain  the  following  algorithm. 

Approximation  Algorithm  (AA):  Given  an  initial  partition  P ,  an  error  functional 
e  satisfying  (4-19),  and  an  error  tolerance  /a  >  0,  AA  produces  as  output  an  admissible 
partition  P'  =  AA (P,  p)  which  is  a  refinement  of  P  in  the  following  way.  First  it  uses 
the  Thresholding  Second  Algorithm  to  produce  a  tree  which  satisfies  PI  and  P2. 
We  can  write  =  T(P)  where  P  is  the  partition  formed  by  the  leaves  of  TM.  We  now 
apply  completion  to  P  as  described  in  Lemma  2.5  to  get  the  admissible  partition  P'  from 
P. 

The  following  lemma  describes  the  properties  of  this  algorithm. 

Lemma  4.2  The  output  P'  =  AA (P,  p)  satisfies 

E(T(P'))  <  p.  (4.26) 

In  addition,  there  are  absolute  constants  c1}  C\  >0  such  that  whenever  P  is  any  refinement 
of  P  which  satisfies  E(T(P))  <  C\  p,  then  we  have 

m{P'\P)  <ClN(T(P)\T(P)).  (4.27) 

Also,  the  number  of  evaluations  N(  AA,  P,  p)  of  e  needed  to  compute  the  output  P'  of  A  A 
satisfies 

N( AA,  P,  p)  <  2 (#(P))  <  2 #(P').  (4.28) 
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Proof:  The  subadditivity  property  (4.19)  guarantees  that  the  error  for  the  admissible  tree 
T(P')  is  smaller  than  that  of  T/; .  Hence  (4.26)  follows  from  (4.23).  If  P  is  any  refinement 
of  P  which  satisfies  E(T(P ))  <  C\ p,  then  by  virtue  of  (4.25), 

N{T{P)\T{P))  <  C1iV(r(P)|r(P)).  (4.29) 

That  is  the  number  of  subdivisions  needed  to  create  P  from  P  does  not  exceed  the 
right  side  of  (4.29).  By  definition,  m(P'|P)  =  N (T (P)\T (P)) .  Finally,  (4.28)  is  obvious 
since  the  only  values  of  e  that  need  to  be  computed  in  the  execution  of  the  Second 
Thresholding  Algorithm  are  those  on  the  nodes  of  T(P)  and  the  number  of  these 
nodes  does  not  exceed  2 #(P).  © 

4.4  Approximation  of  the  right  hand  side 

We  shall  describe  in  this  section  an  algorithm  for  approximating  the  right  hand  side  / 
whenever  /  £  L2(fl).  This  algorithm  will  be  a  step  in  our  adaptive  algorithm  for  solving 
(1.1).  We  shall  approximate  /  by  piecewise  constants  on  admissible  partitions  that  are 
adaptively  generated  using  newest  vertex  bisection.  To  achieve  this  approximation  we 
shall  use  the  adaptive  approximation  algorithm  AA  of  §4.3.  Let  P0  be  any  fixed  initial 
partition  and  let  T*  denote  the  master  tree  for  newest  vertex  bisection.  As  before,  we  let 
Va  denote  the  class  of  all  admissible  partitions  that  can  be  obtained  using  newest  vertex 
bisection  with  initial  partition  P0. 

For  each  A  £  T*,  we  define 

e(/,A)  =  e(A):=|A|||/-/A|||2(A)).  (4.30) 

The  subadditivity  property  (4.19)  follows  easily  from  the  fact  that  /a  is  the  best  L2( A) 
approximation  to  /  by  constants. 

The  following  statements  describe  our  algorithm  for  approximating  /  and  its  proper¬ 
ties: 

APPROX  (/,  P,  p)  — *  P'\  The  input  of  this  algorithm  is  the  function  f  £  L2(fi),  an 
admissible  partition  P  £  Va ,  and  the  error  tolerance  p  >  0.  The  algorithm  uses  the 
inputs  /,  P  and  p2  in  algorithm  AA  with  the  functional  e  defined  by  (4.30)  and  receives 
the  output  partition  P'  =  APPROX (/,  P,  p)  :=  AA(P,p2)  which,  by  the  properties  of 
AA,  is  admissible  and  satisfies: 

E(f,P')  =  E(T(P'))<p2.  (4.31) 

From  the  properties  of  (A A)  we  immediately  have  the  following  facts,  see  Lemma  4.2. 

Proposition  4.3  If  P  is  any  partition  which  is  a  refinement  of  P  and  satisfies  E(f ,  P )  < 

Cip2 ,  then 

m{P'\P)  <ClN{T{P)\T{P)),  (4.32) 

where  ci,  C\  are  the  constants  associated  to  AA.  The  number  of  evaluations  N( APPROX,  /,  P,  p) 
of  e  needed  to  compute  the  output  P'  to  APPROX  satisfies 

X( APPROX,  /,  P,  p)  <  2(#(P')).  (4.33) 
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The  following  lemma  describes  how  we  shall  use  APPROX. 

Lemma  4.4  If  we  apply  APPROX  to  an  input  function  f  6  As  =  As  (H~l  (fl))  ,  s  >  0, 
an  input  partition  P ,  and  an  input  tolerance  p  >  0,  then  the  output  partition  P'  = 
APPRO X(/,  P,  p)  satisfies 

m(P'\P)  <  CMWfW'iy-'1'  (4.34) 

with  C\  (,s)  a  constant  depending  only  on  s.  Also, 

#(P’)  <  C2(S)(#(P)  +  ll/llWP  (4.35) 

with  C*2  (s)  a  constant  depending  only  on  s. 

Proof:  Let  P*  be  the  smallest  partition  in  P  which  satisfies 

E(f,P*)<Clp2.  (4.36) 

where  c\  is  the  constant  appearing  in  algorithm  AA.  Then,  by  the  definition  of  the  class 
As,  we  have 

#(n  <  ll/llpv^)-17’.  (4.37) 

We  let  P  be  the  smallest  partition  which  is  a  common  refinement  of  P  and  P*.  Then,  P 
can  be  obtained  from  P  using  at  most  #(P*)  refinements.  Also,  E(f,P)  <  cip2  because 
P  is  a  refinement  of  P* .  Therefore,  the  output  partition  P'  of  APPROX  satisfies  (4.32) 
which  means  that 

m(P\P)  <  CiN(T(P)\T(P))  <  C, ||/|iy/(^)-1/*.  (4.38) 

This  proves  (4.34).  To  estimate  the  cardinality  of  P'  we  can  use  (2.16)  and  (4.37)  to 
obtain 

#(P')  <  #(P„)  +  C2(m(P\P0)  +  m(P’\P))  <  C2(#(P )  +  C.MII/llW P  (4'3£)) 
which  gives  (4.35).  © 

4.5  Coarsening 

In  this  section,  we  describe  our  second  application  of  the  adaptive  approximation  algo¬ 
rithm.  The  setting  is  the  following.  Suppose  that  P  is  any  admissible  partition  and  up  is 
the  Galerkin  solution  to  (1.1)  on  this  partition.  We  assume  that  we  have  a  function  4>(P) 
which  provides  an  upper  bound  for  the  Galerkin  error 

I  it  -  Mp|||2  <  4>(P).  (4.40) 

Such  an  upper  bound  <f>  is  given  in  §5  (see  (5.2)). 

The  algorithm  we  shall  construct  in  this  subsection  starts  with  the  input  of  two  ad¬ 
missible  partitions  P,  P'  with  P'  a  refinement  of  P,  and  a  tolerance  p  for  which  we  know 
the  bounds 

|||w  -  up |||2  <  $(P)  <  p2  (4.41) 


23 


and 


I u  —  Up'  |||2  <  $(P')  <  ap2.  (4.42) 

At  this  stage,  the  constant  0  <  a  <  1  is  arbitrary  but  fixed.  We  shall  specify  the  value  of 
a,  and  the  two  constants  [3  and  7  that  follow,  at  the  end  of  §7.  For  now  the  reader  should 
think  of  these  as  arbitrary  but  fixed  constants  satisfying  the  relations  we  specify  below. 

With  these  inputs,  we  shall  generate  an  admissible  partition  P  which  is  a  coarsening 
of  P'  such  that  a  certain  further  refinement  r(P )  satisfies 

If u  —  Mr(p)|||2  <  /3/j2,  (4.43) 

where  a  <  (3  <  1  .  The  main  property  (not  held  by  P')  we  will  gain  in  the  construction 
of  P  is  that  we  will  be  able  to  favorably  bound  its  number  of  elements.  The  reason  we 
need  the  refinement  r(P )  of  P  and  not  just  P  in  (4.43)  is  a  technical  one  that  will  be 
explained  later  in  this  section. 

To  find  P,  we  are  going  to  use  an  adaptive  approximation  algorithm  SAA  which  is 
a  modification  of  A  A  that  was  introduced  and  analyzed  in  [2],  The  algorithm  provides 
an  approximation  to  u pi  on  a  coarser  partition  (namely  P)  than  P' .  We  will  explain  the 
algorithm  SAA  in  more  detail  as  we  proceed. 

We  start  by  defining  an  error  functional  e  for  approximating  functions  w  G  II (\  ( Q )  by 
functions  from  Sp  with  P  an  admissible  partition.  To  do  this,  we  introduce  the  minimal 
ring  associated  to  a  triangular  cell  A  G  T*.  Given  any  admissible  partition  P  from  Va 
and  A  G  P,  we  define 

P(A,P):  =  U  (4.44) 

A'eP,AnAV0 

which  is  the  first  ring  about  A.  This  ring  depends  on  P.  However,  we  can  find  a  minimal 
ring  about  A  which  does  not  depend  on  P.  Namely,  we  define 

P_(A):=  p|  P(A,P)=  U  A'  (4.45) 

PePyAeP  A'eP_(A) 

where  P_(A)  is  the  collection  of  cells  from  T*  which  touch  A  and  make  up  P_( A)  . 

Before  returning  to  the  definition  of  e  that  we  will  use,  we  take  this  opportunity 
to  develop  a  little  further  the  properties  of  the  set  P_( A)  because  we  will  need  these 
properties  later  in  this  section.  Let  m0  be  the  maximal  valence  that  can  occur  for  any 
vertex  v  in  any  admissible  partition.  Because  of  the  uniform  boundedness  from  below  of 
the  angles  in  triangular  cells  for  admissible  partitions,  it  follows  that  mo  is  a  fixed  finite 
constant. 

Lemma  4.5  Suppose  P  is  an  admissible  partition  and  A  G  P  has  the  label  (k  + 1,  k+  f,  k). 
Any  cell  A '  G  P  such  that  A'  fl  A  7^  0  has  label  ( k  +  v,k  +  v,k  +  v  —  1)  where  —m°~2  < 

<  rnp  +  l 
u  —  2 

Proof:  Any  such  A'  shares  a  vertex  v  with  A.  This  vertex  is  common  to  two  edges  of 
A,  one,  which  we  denote  by  e0,  is  labelled  k  +  1,  and  the  other,  which  we  denote  by 
£1,  is  labelled  k,  or  k  +  1.  We  can  assume  that  all  other  edges  that  share  v  arise  when 
transversing  the  edges  from  £0  to  in  a  clockwise  direction.  We  claim  that  the  maximum 
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label  of  any  of  the  edges  containing  n  is  k  +  £  where  £  <  m°2+1 .  To  see  this  note  first  that 
the  labels  of  edges  that  share  a  cell  can  differ  by  at  most  1.  Now  suppose  that  the  number 
of  edges  incident  at  v  is  r.  Let  e  be  such  an  edge  with  label  k  +  £.  Stepping  from  e0  to 
e,  we  need  at  least  £  —  2  more  edges  between  these  two.  Moving  from  e  to  ey  at  least 
another  i  —  2  more  edges.  That  makes  a  total  count  of  at  least  3  +  2(£  —  2)  =  2£  —  1  <  r, 
and  therefore  £  <  Ly-.  This  confirms  the  upper  bound  for  v.  To  obtain  the  lower  bound, 
suppose  now  that  v  is  common  to  an  edge  e  with  label  k  —  £  and  that  again  r  edges  meet 
at  v.  It  takes  at  least  £  intermediate  edges  to  go  from  e0  to  e  and  at  least  £  —  1  edges  to 
go  from  e  to  £| .  Thus,  the  count  is  at  least  3  +  £+(£—  1)  =  2£  +  2  <  r  and  therefore 
£  <  This  means  that  £  <  m°~ 2  which  confirms  the  lower  bound.  © 


For  the  formulation  of  the  next  lemma,  we  introduce  the  notion  of  a  full  refinement. 
Given  an  admissible  partition  P,  by  a  full  refinement  of  P  we  mean  the  subdivision  of 
each  cell  in  P  into  six  cells  using  newest  vertex  bisection.  These  new  cells  consist  of  two 
grandchildren  of  A  and  four  great  grandchildren  of  A  and  result  in  the  bisection  of  each 
original  edge  of  A  as  well  as  the  bisection  of  the  new  edge  added  when  A  is  subdivided 
using  newest  vertex  bisection  (see  Figure  6). 

Lemma  4.6  Assume  that  P  is  any  admissible  partition  andr(P)  is  the  partition  obtained 
from  P  by  subjecting  each  cell  in  P  to  \nf-~\  full  refinements.  Then  r(P)  is  admissible  and 
any  ring  P(A',r(P))  of  a  cell  A'  G  r(P)  is  contained  in  the  minimal  ring  P_( A)  of  the 
ancestor  A  G  P  of  A'. 

Proof:  The  admissibility  of  r(P)  follows  from  the  fact  that  one  full  refinement  of  every 
cell  in  an  admissible  partition  leaves  no  hanging  nodes.  For  the  second  property  note 
that  a  full  refinement  to  a  triangular  cell  with  label  (k  +  1,  k  +  I,  k)  produces  cells  whose 
new  edges  carry  labels  that  are  at  least  k  +  2.  Thus  £  successive  full  refinements  produce 
triangles  with  generations  at  least  k  +  2£.  Next  note  that  the  minimal  ring  of  A  con¬ 
sists  of  triangles  touching  A  and  having  the  highest  possible  generation  admitted  by  T*. 
We  have  shown  in  Lemma  4.5  that  the  minimal  generation  of  cells  in  R(  A,  P)  is  greater 
than  k  —  m°2~2  —  1  =  k  —  'nf-.  Thus,  applying  full  refinements  the  smallest  possible 
generation  appearing  in  P(A',r(P))  will  increase  to  k  —  ^  +  2|’2^a]  >  k  +  This  is 
larger  than  the  highest  possible  generation  appearing  in  R(  A,  P)  which  by  Lemma  4.5  is 
at  most  k  +  m°2+1  —  1.  The  proof  is  complete.  © 
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We  now  return  to  our  problem  of  coarsening.  Given  a  function  w  E  Hq  (Q)  (later  we 
shall  take  w  as  an  approximation  to  up>),  we  define 

e(A)  :=  e(w,  A)  :=  inf  \\w  -  S1Ihi(jma))  =  Ik  “  Sa\\hi(r_(A)),  (4-46) 

where  the  infimum  is  taken  over  all  continuous  piecewise  linear  functions  S  defined  on 
R—(  A)  which  are  subordinate  to  P_( A)  and  vanish  on  dQ  fl  dR-(  A)  whenever  this  set  is 
not  empty. 

This  error  functional  e  will  not  satisfy  (4.18)  but  does  satisfy  the  following  weaker 
version  of  this  property: 

Weak  subadditivity:  For  any  A  E  T*  and  any  admissible  tree  T  which  has  A  as 
its  only  root  node,  we  have 

H  e(A')  <  Ce( A)  (4.47) 

A'S  C(T) 

where  C  is  an  absolute  constant  and  C(T)  is  the  set  of  leaves  ofT. 

Thus  the  only  distinction  in  this  weak  subadditivity  is  that  (4.47)  is  only  required  to 
hold  for  admissible  trees. 

Proposition  4.7  The  local  error  functional  e,  defined  by  (f-f6),  satisfies  the  weak  sub¬ 
additivity  property  (4-4V-  Moreover,  for  E  =  E(w,  •)  defined  by  (4-20)  for  this  choice  of 
e,  and  any  admissible  partition  P  G  Va  one  has  for  any  w  G  Hq  (Q) 

E(w,T(P ))  <  C4  inf  ||W  -  S\\2Hl(n),  (4.48) 

seSp 

where  the  constant  C'A  depends  only  on  the  initial  partition  Pq  which  consists  of  the  roots 

of  T*. 

Proof:  Let  T  be  a  finite  admissible  tree  with  single  root  A.  The  leaves  C(T)  of  T  form 
an  admissible  partition  of  A.  For  each  A'  E  £(T),  we  have  that  P_( A')  is  a  refinement 
of  P_(A)  on  the  set  P_(A'),  and  P_(  A')  C  P_(A).  Therefore, 

11'^  —  'S,A'||-ff1(-R_(A'))  ^  llw  —  'S,A||ri'1(i?_(A')),  (4.49) 

by  the  very  definition  of  S'a'- 

Let  us  observe  that  a  given  point  x  G  P_( A)  appears  in  at  most  C  of  the  sets  P_( A'), 
A'  G  £{T),  with  C  an  absolute  constant.  Indeed,  if  x  G  A*  with  A*  E  £{T),  then, 
because  of  the  minimality  of  the  rings,  x  will  not  appear  in  any  P_(  A')  unless  A'  touches 
P_( A*).  Since  the  partition  C{T)  is  admissible,  there  are  at  most  C  cells  which  touch 
P_( A*)  with  C  an  absolute  constant.  We  use  this  property  and  the  set  subadditivity  of 
||  •  |  |^-i  to  find, 

\\w  —  'S,A'||ifi(_R_(A'))  —  11^  —  'S,A||/fi(_R_(A'))  —  ^11^  —  'S,A||/fi(_R_(A))-  (4.50) 

a  'en(T)  A'en(T) 

This  inequality  verifies  property  (4.18)  for  this  choice  of  e. 
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As  for  (4.48),  let  Wp  G  Sp  be  the  minimizer  of  the  right  side  of  (4.48)  and  note  that 
only  a  finite  uniformly  bounded  number  of  the  minimal  rings  P_( A),  A  G  P,  overlap  at 
any  given  point  in  fl.  Hence, 

\\w  ~  ,S,A||/fi(_R_(A))  <  ^  11^  —  wp\\h1(R.(A))  <  C\\lV  —  Wp  | |lfi(o) , 

Ae p  A eP 

as  claimed.  © 

We  cannot  directly  reverse  the  inequality  (4.48)  which  is  why  we  introduced  the  re¬ 
finement  r(P)  which  will  allow  us  to  do  the  reversal.  It  is  obvious  that  r(P)  is  again 
admissible. 

Proposition  4.8  There  exists  a  constant  C4  depending  only  on  the  initial  partition  P0 
such  that  for  any  admissible  refinement  P  G  Va  of  P0,  one  has 

inf  lk-S||«.(I!)  <CtE(w,T(P)).  (4.51) 

sesr(p) 

Proof:  We  shall  employ  quasi-interpolants  for  bounding  the  best  approximation  in  the 
energy  norm.  To  this  end,  recall  that  the  Courant  elements  0V.  v  G  Vr(P),  form  a  basis 
for  <Sr(P).  Thus  each  S  G  Appj  has  the  unique  representation 

S=  S(v)^  =  J2  A-(S)<y,  (4.52) 

v  €Vr(P)  vtVr(P) 

where  the  <pv  are  normalized  to  have  norm  one  in  fP(fi),  i.e.  ||0„||_ffi(n)  =  1,  and  where 
the  A„  are  dual  functionals,  i.e.  A v(</>vi)  =  Svy.  This  means  that  the  A„  also  have  norm  one 
as  linear  functionals  on  <Sr(p)  when  this  space  is  equipped  with  the  fP(fl)  norm.  There  is 
a  norm  preserving  extension  A„  (which  we  continue  to  denote  by  A„ )  to  all  of  //,]  (Q)  and 
we  can  require  that  this  extension  is  given  as  the  integral  with  an  L->  function  A„: 

K(g)  =  [  gK,  (4.53) 

J  By 

with 

Bv  :=  supp(A„)  C  r(P)  :  v  G  A},  (4.54) 

the  star  of  v.  It  follows  that 

I  A,  (s)|  <  \\g\\m(Bv)-  (4.55) 

Clearly,  Qr(p){w )  :=  XLeV  (P)  K{w)fiv  defines  a  projector  from  L2(Tl)  into  Sypy  Let 
us  now  bound  ||rc  — Qr(p)(w)||_ffi(n).  Given  any  triangular  cell  A  G  P,  we  take  an  extension 
of  Sa  to  a  continuous  piecewise  linear  function  on  Q,  vanishing  on  dfl  which  is  subordinate 
to  r(P).  This  is  possible  since  P_( A)  is  contained  in  T(r(P)).  We  denote  this  extension 
also  by  Sa-  We  then  have  Qt(P){Sa )  =  Sa  so  that,  in  particular,  both  quantities  agree 
on  A.  Therefore  it  follows  that 

llW  —  Qr(P){w)\\Hi(A)  —  2||w  —  (SaIIp-^a)  +  2||Qr(P)('^’  —  'S'a)IIp'I(A) 

<  2e(A)  +  2|| Qr{P)(w  -  Sa)|&i(a)-  (4-56) 
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We  need  to  bound  the  second  term  in  (4.56).  Let  us  note  that  on  A  only  finitely  many 
terms  \v(w  —  S'a)^  are  nonzero.  These  correspond  to  the  vertices  from  r(P)  that  are  in 
A  and,  in  view  of  the  fact  that  each  full  refinement  subdivides  each  triangle  into  six  and 
introduces  four  new  vertices  on  it,  there  are  at  most  C  =  3  +  4  ^6^1  —  1  j  vertices  from 
r(P )  in  A.  Let  v  be  one  of  these  vertices,  then 

||A„(?n  —  <S'a)0«||#i(a)  —  11^  —  5a||#i(.b„)  <  \\w  —  5a||p-i(p_(a))  >  (4-57) 

where  we  have  used  that,  by  Lemma  4.6,  Bu  C  i?_(A),  v  E  An  Vr(p).  Hence,  summing 
over  all  vertices  v  G  Vr(p)  in  A,  we  obtain 

\\Qr{p){w  —  S/\) ||p-i(A)  A  C'e(A).  (4.58) 

Using  this  back  in  (4.56)  and  summing  over  all  A  G  P,  gives 

\\w  —  Qr(P){'^)\\H1(n)  —  ^4  E(w,T(P)).  (4.59) 

This  proves  the  assertion.  © 

Throughout  the  remainder  of  this  section  let  E  be  defined  by  (4.20)  for  e,  defined  by 
(4.46)  with  w  an  arbitrary  but  fixed  II 1  ( Q )  function.  We  return  now  to  our  problem  of 
generating  the  set  P.  For  this  we  shall  use  the  Modified  Second  Algorithm  of  §7  in 
[2] .  This  algorithm  exactly  matches  our  setting  of  newest  vertex  bisection.  We  recall  the 
properties  of  this  algorithm  (see  Corollary  7.4  of  [2]). 

Modified  Second  Algorithm  (MSA):  Given  any  function  w  G  and  any 

error  tolerance  /a  >  0,  MSA  produces  as  output  an  admissible  partition  P'  =  MSA(w,  ji) 
such  that 

E(w,  T(P'))  <  n.  (4.60) 

Moreover,  there  are  absolute  constants  Ci,Ci  >0  such  that  whenever  P  is  any  admissible 
partition  which  satisfies  E(w,T(P ))  <  C\  ji,  then  we  have 

#(P')  -  #(P0)  =  N(T(P'))  <  C\N (T(P))  =  CfiPiP)  -  #(P0)).  (4-61) 

The  number  of  evaluations  1V(MSA,  w,  ji)  of  e  to  compute  the  output  P'  to  MSA  satisfies 

N(MSA,w,ii)  <C3(#(P')).  (4.62) 

In  contrast  to  APPROX,  the  algorithm  MSA  always  starts  from  the  root  partition  P0. 
We  could  also  have  developed  an  algorithm  which  started  from  any  given  initial  partition 
P  but  this  would  be  at  the  expense  of  more  technicalities  which  we  wish  to  avoid. 

We  shall  use  the  algorithm  MSA  in  the  following  coarsening  algorithm  which  is  the 
main  algorithm  of  this  section.  The  constant  7  in  this  algorithm  will  be  specified  later. 
The  reader  should  think  of  7  as  any  fixed  constant  0  <  7  <  1  until  it  is  later  specified. 
The  algorithm  COARSE  will  also  use  GAL. 

COARSE:  We  take  as  inputs  the  admissible  partition  P' ,  the  tolerance  p  such  that  (4-42) 
holds,  and  an  initial  approximation  up<  G  Sp>  to  up>  satisfying  \\upi  —  up'\\2  <  (A'p)2. 
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The  algorithm  COARSE  outputs  the  admissible  partition  P  =  COARSE (P',p,up<)  as 
follows.  We  apply  GAL  to  obtain  the  numerical  approximation  upi  =  GAL(P',  p,  up>). 
We  take  w  =  upi  and  apply  MSA  with  inputs  w  and  ^p2  to  obtain  P  =  MSA  (w,^p2) 
where  7  will  be  specified  below.  It  follows  that 

E(uPi,T(P))  <  ^p2.  (4.63) 


The  rest  of  this  subsection  will  be  devoted  to  deriving  the  important  properties  of  the 
output  P  of  COARSE.  Let  us  first  observe  that  given  P',  the  number  of  computations 
necessary  to  invoke  the  algorithm,  is  bounded  by 

iV(COARSE,  P1,  p,  up 0  <  CfiA1 /8)#{P')  (4.64) 

with  C3(A' /5)  a  constant  depending  only  on  the  ratio  A'/8,  see  (3.6)  and  (3.7).  We  have 
already  observed  that  the  number  of  computations  to  calculate  up/  is  bounded  by  the 
right  side  of  (4.64)  (see  (3.8)).  To  compute  S'a,  A  E  P',  from  the  least  squares  problem 
uses  a  fixed  number  of  computations  because  up>  is  continuous  and  piecewise  linear  and 
the  number  of  cells  in  P_(  A)  has  an  absolute  bound  (see  our  discussion  of  the  structure 
of  P_( A)  given  above).  We  have  observed  in  (4.62)  that  the  number  of  evaluations  of  e 
needed  to  execute  MSA  is  bounded  by  C3ff(P').  the  estimate  (4.64)  follows  now  from 
Remark  3.1. 

Next,  we  want  to  bound  |||u  —  ur(pj|  for  the  Galerkin  solution  ur^py  To  do  this,  we  let 
S'  be  a  best  approximation  to  up/  from  <Sr(p)  in  the  P1(f2)  norm.  Thus, 

I'll  —  I  ^  |ii  —  S'!  ^  ||ii  —  tip/ 1  +  I  tip/  —  up/ 1  +  I  lip/  —  S'! 

<  (y/ap  +  8p)  +  1 1 up/  —  S'||pi(Q)  <  (y/a  +  8  +  \fC^y)p  <  \ffip  (4.65) 

where  in  the  third  inequality  we  used  (4.42)  on  the  first  term  and  we  used  (3.7)  for  the 
second  term.  Then  we  use  Poincare  (i.e.  (3.3))  followed  by  (4.51)  on  the  remaining  term. 
The  last  inequality  will  hold  provided  a,  5,  and  7  are  chosen  so  as  to  satisfy 

a  <  (3/9  and  8  <  and  C47  <  (3/9.  (4.66) 

We  impose  these  requirements  on  a,  8  and  7  in  what  follows. 

We  now  summarize  the  properties  of  the  output  P  of  the  algorithm  COARSE. 

Theorem  4.9  Given  as  inputs  the  admissible  partition  P1  ,an  input  tolerance  p,  and  an 
initial  approximation  upi  the  output  P  =COARSE(P',  /1,  up/ )  has  the  following  proper¬ 
ties: 

(i)  The  Galerkin  solution  ur(pj  onr(P)  satisfies: 

|||u  —  ur(p)|||2  <  (3p 2 .  (4.67) 

(ii)  The  number  of  computations  A(COARSE,  P',  p,  upi)  used  to  compute  P  satisfies 

N( COARSE,  P',  p , up/)  <  C3(Ai/8){#{P1))  (4.68) 
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with  C${A! / 8)  a  constant  depending  only  on  A' / 8. 
(in)  If  u  G  As  =  As{Hq  (fi)),  then 


#(P)  <  #(Po)  +  C2(s)\\u\\]';(H,m)V 


rl/s 


(4.69) 


with  C2(s)  a  constant  depending  only  on  s. 

Proof:  Statement  (i)  is  just  (4.65).  Statement  (ii)  follows  from  (4.64).  The  proof  of  (iii) 
is  similar  to  the  proof  in  Lemma  4.4.  Let  P*  be  the  smallest  admissible  partition  that 
satisfies 


I u  —  Up * |||2  ^  1 1 it  —  Up * ||jyi(Q)  ^  c2/i2  (4.70) 

where  c.\  is  a  constant  that  will  be  specified  later  in  this  proof.  /.From  the  fact  that  u  G  As, 
we  know 

#<n  <  iHi  (4.7i) 

Moreover,  note  that  up *  also  approximates  u,p>  well.  In  fact,  by  (4.42)  and  (3.7), 

|||up»  —  tip*  I  ^  If  iip»  —  it  HI  T  |||iipi  —  Up' I  T  HI  ii  —  Up*  I  ^  (\Zct  T  8  T  Ci)/i. 


Therefore,  from  (4.48)  and  (3.3),  we  obtain 

E(up>,T(P*))  <  C'4\\upl  -  up*  ||^i(n)  <  c^C'^sfa  +  8  +  Ci)2/i2  <  c^p2 
provided 

a  <  (CiTHm  and  <  WEPAl. 

9  ’  3 

We  can  now  apply  (4.61)  and  obtain 

#(P)  -  #(U)  <  C1(#(P*)  -  #(P„))  <  CrftP”)  <  CjfsJllull^V1^ 

where  the  last  inequality  uses  (4.71). 


(4.72) 

(4.73) 

(4.74) 

© 


Let  us  remark  on  the  order  that  all  of  the  above  constants  are  chosen.  First,  we  are 
free  to  choose  0  <  (3  <  1.  With  the  value  of  3  fixed,  we  now  choose  7  so  that  the  second 
inequality  in  (4.66)  is  satisfied.  Then  we  choose  a  so  that  both  the  first  inequality  in 
(4.66)  and  the  first  inequality  in  (4.73)  is  satisfied.  We  also  choose  c\  so  that  the  second 
inequality  in  (4.73)  is  satisfied.  Further,  we  require  that  5  satisfies  (4.73).  Later  in  §7,  we 
shall  fix  the  value  of  f3  and  then  all  other  constants  can  be  specified. 


5  The  adaptive  algorithm  of  Morin,  Nochetto,  and 
Siebert 

In  this  section,  we  shall  recall  the  AFEM  of  Morin,  Nochetto,  and  Siebert  [17]  which  is 
based  on  subdividing  (using  the  newest  vertex  rule)  certain  collections  of  marked  cells. 
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We  shall  later  modify  this  algorithm  to  arrive  at  our  new  algorithm.  Our  notation  is 
slightly  different  from  theirs  in  places. 

The  strategy  in  [17]  for  marking  cells  for  subdivision  and  the  guaranteed  error  reduc¬ 
tion  that  results  is  based  on  two  criteria.  The  first  of  these  is  a  local  error  indicator  that 
we  now  describe.  Suppose  that  P  E  Va  is  an  admissible  partition.  Given  an  edge  E  G  Sp, 
we  let  0-e  denote  the  union  of  the  two  triangles  that  share  E  and  let  £e  denote  the  length 
of  E.  We  also  denote  by  Je(P)  the  jump  of  n^'Vup  across  E.  where  for  each  cell  sharing 
E.  nE  is  the  respective  outer  normal  of  E.  The  quantity 

Ve  ■  =  Ve(P)  ■=  W^JeMWUe)  +  ||diam(flJS)/||£a(nj5),  (5.1) 

is  an  indicator  of  the  local  error  in  u  —  up.  One  can  show  that  the  sum  of  these  local 
indicators  bounds  the  square  of  the  global  energy  error  from  above.  More  precisely,  there 
exists  an  absolute  constant  A:>  such  that 


u-up  r  S 


<  A 


3  Y  Ve- 

Ee£p 


We  therefore  define  the  global  quantity 

$(P)  :=  A3  Y  Ve-  (5-2) 

E^Sp 

The  second  ingredient  that  we  shall  need  is  a  bound  for  the  global  H~ 1  error  in 
approximating  /  E  L2(Q)  by  piecewise  constants  in  terms  of  a  sum  of  local  quantities. 
We  have  already  introduced  these  quantities  in  §4.4.  We  recall  the  local  error  functional 

e(A):=|A|||/-/A||i2(A),  (5.3) 

where  /a  is  the  average  of  /  on  A.  We  also  have  the  global  error 

E(/,P):=y>(A),  (5.4) 

AeP 


which  was  used  in  algorithm  A  A. 

The  local  error  indicators  from  (5.1)  together  with  (5.4)  can  be  used  to  prove  the 
following  bounds  for  the  Galerkin  error  (see  [17]  equations  (2.7)  and  (2.8)): 

Ai$(P)  —  A2E(f,P)  <  I u-upf  <  $(P)  (5.5) 

where  AllA2>  0  are  absolute  constants. 

To  describe  the  main  result  in  [17],  we  denote  for  each  A  E  T*  by  G( A)  the  set  of 
six  triangular  cells  obtained  from  A  when  using  a  full  refinement  as  described  in  the 
paragraph  preceding  Lemma  4.6  (see  Figure  6).  The  following  theorem  is  Theorem  3.1  of 
[17]: 
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Theorem  5.1  Given  0  <  9  <  1,  there  exist  constants  0  <  r  <  1  and  A4  >  0  with  the 
following  property.  Let  P  be  any  admissible  partition  in  Va  and  suppose  that  £0  is  a 
collection  of  edges  from  £p  such  that 

^3  V e  >  0*(P),  (5.6) 

EGlSq 

and 

E(f,P)<A4p2.  (5.7) 

If  P'  is  any  admissible  refinement  of  P  that  contains  G( A)  or  a  refinement  of  G(  A)  for 
every  A  which  has  an  edge  in  common  with  £0,  then  either  \\u  —  upi  |||2  <  p2  or 

|||it  —  up'  |||2  <  r|||w  —  up |||2.  (5.8) 

In  other  words,  if  the  error  in  approximating  /  is  small  enough  then  the  refinement 
strategy  (5.6)  guarantees  an  error  reduction  until  the  desired  tolerance  p  is  met.  Using 
this  result,  Morin,  Nochetto,  and  Siebert  build  an  AFEM  which  they  prove  converges 
whenever  /  e  L2(£l). 

Given  a  partition  P  and  a  marked  set  £0  of  edges  satisfying  (5.6),  we  shall  say  that 
a  partition  P'  which  is  a  refinement  of  P  has  the  G'-property  if  it  is  admissible  and  P' 
contains  G( A)  or  a  refinement  of  G(A)  whenever  A  has  an  edge  in  £0. 

6  Modifications  of  the  MNS  Algorithm 

We  shall  make  some  modifications  of  the  MNS  algorithm  in  this  section  in  order  to 
prepare  for  its  use  in  our  new  algorithm  of  the  next  section.  These  modifications  will  center 
on  two  issues.  The  first  is  to  recast  the  algorithm  in  the  form  of  elementary  markings  and 
then  completions  since  it  is  in  this  form  that  we  can  apply  the  results  of  §2  which  bound 
the  cost  of  completions.  The  second  issue  is  more  substantial.  In  a  numerical  algorithm 
we  cannot  compute  up  or  (1>(P)  exactly.  We  shall  have  to  replace  these  computations  by 
approximate  computations  tip  and  'hfPj.  We  shall  need  to  examine  the  effect  of  these 
approximate  calculations.  We  shall  show  that  it  is  possible  to  calculate  these  quantities 
approximately  while  still  retaining  the  error  reduction  in  the  MNS  algorithm.  Moreover, 
we  will  be  able  to  bound  the  computational  complexity  of  these  approximate  calculations. 

To  begin  the  analysis  of  this  section,  we  first  note  some  properties  of  E  and  T.  First 
of  all  if  P'  is  a  refinement  of  P  then 

E(f,P')<E(f,P).  (6.1) 

This  follows  from  the  definition  of  E  (4.15).  Secondly, 

E(f,P)<(A1/2A2)$(P)  implies  $(P)  <  Asju  -  uPf  (6.2) 

with  A5  =  2/ A\.  This  follows  from  the  lower  inequality  in  (5.5). 

These  two  observations  will  be  used  several  times  in  the  following  form. 
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Remark  6.1  Assume  that  for  some  admissible  partition  P  we  have 

Ai 

j45 I'll  —  tip |||2  <  p2  and  E(f,P)  <  — -r~p2-  (6.3) 

2A2 

Then  for  any  admissible  refinement  P'  of  P  one  has 

$(P')  <  p2.  (6.4) 

Proof:  Suppose  that  <h(P')  >  p2.  Then,  by  (6.1), 

>  -  E(,'F)  -  ■ 

Thus  (6.2)  applies  and  since  P'  is  a  refinement  of  P  our  assumption  (6.3)  yields 

$(Pf)  ^  v4.5  HI  "it  —  'Up'|||2  ^  v4.5  III  "it  —  'iip|||2  A  p2 1 

which  is  a  contradiction  and  hence  proves  (6.4).  © 

Our  first  modification  of  MNS  which  we  shall  call  MMNS  is  to  recast  it  in  the  form 
of  marking  and  completion  and  still  have  the  property  that  the  refined  partition  contains 
G'(  A)  (we  need  this  to  be  able  to  obtain  the  guaranteed  reduction  in  Theorem  5.1).  Here 
is  a  description  of 

MMNS  (P,  p)  — *  P1:  This  algorithm  takes  as  input  an  admissible  partition  P,  the 
corresponding  Galerkin  approximation  up  and  an  error  tolerance  p  with  P  satisfying 
E(f,P)  <  A4p2  and  produces  a  new  partition  P1  =  MMNS(P,/i)  as  follows.  We  mark 
all  triangular  cells  which  have  an  edge  from  So  where  So  are  the  marked  edges  from  MNS. 
Call  this  marked  set  A4o-  We  perform  a  refinement  and  then  a  completion  for  A4o  and 
P  giving  the  new  partition  P[.  We  next  mark  any  cells  A'  in  P[  which  have  a  proper 
descendent  which  is  one  of  the  cells  in  G( A),  A  E  A40-  Call  this  set  of  marked  cells 
AA1.  We  do  a  refinement  and  completion  for  A4i  and  P[  giving  the  new  partition  Pfi  We 
repeat  this  process  one  more  time  ending  with  the  partition  P'  =  Pg . 

Here  are  the  properties  of  MMNS. 

Theorem  6.2  For  the  output  P1  of  MMNS  we  have  that  either  ||  u  —  up>  |||  <  p  or 
Ill'll  —  Up/ 1||2  <  t I -ii— up |||2.  The  number  of  subdivisions  A(MMNS,  P,  p)  needed  in  MMNS 
satisfies 

A(MMNS,  P,  p)  <  C3#(P),  (6.5) 

with  Co  an  absolute  constant. 

Proof:  For  each  cell  A  corresponding  to  a  marked  edge  in  MNS,  the  partition  P'  contains 
a  refinement  of  the  full  refinement  G( A).  Hence,  Theorem  5.1  applies  and  we  obtain  the 
stated  bounds  on  ||  ii  —  up:  |||.  As  for  (6.5),  the  number  of  cells  in  A4o  d°es  not  exceed 
ff{P)  and  hence  #(P()  <  (7#(P)  (see  (2.16)).  Repeating  this  argument  we  derive  (6.5). 
□ 
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Our  next  modifications  of  MNS  center  around  the  fact  that  we  cannot  compute  uP 
or  <f>(P)  exactly.  Therefore,  we  shall  use  the  approximation  uP  :=  GAL (P,n,uP).  We 
define  the  local  error  estimators  as  in  (5.1)  with  uP  in  place  of  uP  and  similarly  define 
$(P)  as  in  (5.2)  with  these  new  rjp  in  place  of  tje ■  A  standard  trace  argument  shows  that 

I Ve  ~  Ve |  <  B[\\uP  -  uP ||^i(nB),  (6.6) 

where  B[  is  some  fixed  constant  depending  only  on  P0.  As  a  consequence  one  has  for 
another  constant  B\  >  1 

A3  | fjE  —  t}e\  <  Bi52/i2.  (6.7) 

EeSp 

With  this  numerical  version  $  in  hand,  we  can  revisit  the  algorithm  MMNS  and  ob¬ 
tain  the  corresponding  numerical  version  of  this  algorithm  which  we  denote  by  NMMNS. 

NMMNS  (fjft'Up)  — *  P'\  This  algorithm  takes  as  input  an  admissible  partition  P,  an 
error  tolerance  /a  with  P  satisfying  E(f,P)  <  A^p2 ,  and  the  numerical  approximation 
Up  :  =  GAL(P,  p,  up),  and  produces  a  new  partition  P'  =  NMMNS(P,  p,  uP )  as  follows. 
We  let  £0  be  any  set  of  edges  in  £P  such  that 

A3  Y,  Ve  >  0*(P).  (6.8) 

Ee£q 

We  mark  all  triangular  cells  which  have  an  edge  from  £0  and  call  this  marked  set  Afo- 
We  perform  a  refinement  and  then  a  completion  for  Afo  and  P  giving  the  new  partition 
P[.  We  next  mark  any  cells  A'  in  P[  which  have  a  proper  descendent  which  is  one  of 
the  cells  in  G( A),  A  £  Afo-  Call  this  set  of  marked  cells  Afi-  We  do  a  refinement  and 
completion  for  Afi  and  P[  giving  the  new  partition  P'2.  We  repeat  this  process  one  more 
time  ending  with  the  partition  P'  =  P( . 

Here  are  the  properties  of  NMMNS. 

Theorem  6.3  If  S2  <  6(ABi)~l ,  then  for  the  output  P'  o/NMMNS  we  have  that  either 
||u  —  uP'  I  <  p  or  |||u  —  uP'  I2  <  r]|u  —  uP |||2  where  0  <  r  <  1,  depends  on  9.  The  number 
of  subdivision  and  computations  A(NMMNS,P,  p)  needed  in  NMMNS  satisfies 

A(NMMNS,  P,  n,  up)  <  C3(A'/5)#(P),  (6.9) 

whenever  the  initial  guess  up  satisfies  (3.6). 

Proof:  From  (6.7),  we  have  that 

0($(P)  -  B \52ii2)  <  6${P)  <AZY,  Ve  <  ^3  ^  Ve  +  B^p2.  (6.10) 

EGlS o  E(ElE o 


We  consider  two  cases: 

CASE  |||u  —  ttp I  <  p2:  Since  P'  is  a  refinement  of  P,  we  have  |||«  —  itp»|||  <  ||| u  —  «p|||  as 
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desired. 

CASE:  Hm  —  Up  I]  >  p2  In  this  case  d>(P)  >  p2  and  2  B452  <  6/2.  Therefore  (6.10)  gives 

(6/2 )<K(P)  <  A3  77 E.  (6.11) 

EeSo 

We  can  therefore  apply  Theorem  5.1  (with  6/2  used  in  place  of  6)  and  obtain  ||ii— up>  |||  <  p2 
or  I u  —  up'  I  <  r|||-ii  —  Tip  I  with  the  r  corresponding  to  6/2. 

The  bound  on  JV(NMMNS,  P,  71,  up)  follows  as  in  the  proof  of  (6.5)  with  the  caveat 
that  we  need  to  add  the  number  of  computations  used  in  GAL.  But  these  are  bounded 
by  C3(A' /S)ff(P)  because  of  (3.8)  in  Remark  3.1.  © 

The  following  algorithm  REDUCE  is  the  main  algorithm  of  this  section.  In  this 
algorithm,  there  appears  a  constant  A*  which  will  be  specified  later.  Its  role  is  to  make 
sure  that  the  term  E(f,  P')  appearing  in  (5.5)  is  negligible  compared  to  $(P')  for  all  of 
the  partitions  P'  that  are  encountered  in  the  execution  of  the  Algorithm. 


REDUCE  (P,  p,p',up)  — *  P'\  This  algorithm  takes  as  input  any  tolerances  0  <  p'  <  p, 
any  admissible  partition  P  for  which  we  know 

$(P)<7i2,  E(f,  P)  <  A*  p2  (6.12) 


and  any  initial  approximation  up  E  Sp  (to  be  used  in  GAL)  satisfying  (3.6).  It  outputs 
a  new  partition  P'  =  REDUCE(P,  p,p',up )  which  is  a  refinement  of  P  and  satisfies 


$(P')<(p')2,  E(f,  P')  <  A*(p')2. 


Here  A*  is  any  constant  satisfying 


A*  <  min 


A4  jM 
A5  2 A2  j 


(6.13) 


(6.14) 


where  A1,A2,A4  and  A5  are  the  constants  from  (5.5),  (5.7)  and  (6.2),  respectively. 

_ 1  /n 

We  describe  now  the  steps  used  to  produce  P' .  We  take  p"  =  p'A5  7  and  use 
NMMNS  with  the  inputs  P  and  p"  and  up.  The  result  is  a  partition  P3  =  NMMNS(P,  p" ,up) 
which  is  a  refinement  of  P.  We  reapply  NMMNS  with  inputs  //" ,  P, ,  and  the  same  up 
as  the  initial  approximation  to  upi.  We  obtain  the  output  P>  =  NMMNS (P| ,  /i",  up). 
we  repeat  this  k  times  (each  time  using  the  same  up  as  an  initial  approximation)  where 
k  is  chosen  as  the  smallest  integer  so  that  rk  <  (A^-1  (p' / p)2 .  We  now  apply  APPROX 
with  the  input  P^  and  v^4*/d  to  get  the  partition  P'  =  REDUCEfP.  //,  //',  uP). 


Let  us  now  check  that  P'  satisfies  (6.13).  First  of  all,  we  know  that  the  second 
inequality  in  (6.13)  is  valid  because  of  the  APPROX  application.  So  we  need  to  verify 
only  the  first  inequality.  Note  that  (6.14)  implies,  in  particular,  that  A*  <  A4(p" / p)2  so 
that  Theorem  6.3  is  applicable  to  each  iteration.  So  we  have  two  possibilities.  The  first- 
case  is  that  |||-ii  —  «p.  If  <  p"  for  one  of  the  values  of  j  G  {1, . . . ,  A:}.  Since  P'  is  a  refinement 
of  Pj ,  we  have 

A5\\u-uP'f  <  Asl'u-'Up.f  <  A5(p")2  <  (p')2.  (6.15) 
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In  view  of  the  second  relation  in  (6.13)  and  the  condition  (6.14)  on  A *,  we  can  invoke 
Remark  6.1  (with  P  replaced  by  P')  to  conclude  that  the  first  relation  in  (6.13)  is  also 
valid  in  this  case. 

The  other  case  is  that  the  error  is  reduced  by  the  factor  r  at  each  application  of 
NMMNS  which  gives 

A^fu  —  Upi 1 2  <  ^4g [I it  —  upk\ ||2  <  A§Tk |||ii  —  up\\2  <  A5rkA(P)  <  (//)2,  (6.16) 

which,  again  by  Remark  6.1  confirms  the  validity  of  the  first  inequality  in  (6.13). 

The  following  proposition  bounds  the  number  of  subdivisions/computations  used  to 
produce  P' . 

Proposition  6.4  The  number  of  elements  ff{P')  satisfies 

#(P')  <  C3(v/iJ)#(P).  (6.17) 

The  number  ^(REDUCE,  P,  p,p',up)  of  computations  used  to  produce  P'  also  satisfies 

A(REDUCE,  P,  p,  n' ,uP)  <  C3(p,p/p')#(P),  (6.18) 

where  p  is  the  maximal  ratio  A' / 5  for  all  constants  A'  from  (3.6)  encountered  in  the  calls 
of  GAL  invoked  in  REDUCE. 

Proof:  From  Theorem  6.3,  we  have  that  #(Pi)  <  C#(P)  and  ff(Pj+ 1)  <  C#(Pj),  for 
j  =  1, . . . ,  k  —  1.  Since  k  is  fixed,  this  gives  the  bound  (6.17).  Regarding  the  number  of 
computations,  we  know  that  the  number  of  calculations  needed  in  GAL  to  compute  up 
from  tip  does  not  exceed  Cff(P)  with  the  constant  depending  only  on  A  and  the  ratio 
p/ p' .  The  number  of  calculations  in  each  iteration  of  NMMNS  uses  at  most  C(p)ff(Pj) 
calculations  as  was  shown  in  Theorem  6.3.  Therefore,  (6.18)  holds  in  this  case  as  well.  © 


7  The  Main  Loop 


We  are  now  in  a  position  to  build  the  main  iteration  of  our  adaptive  algorithm.  This  loop 
will  use  the  REDUCE  algorithm  to  generate  a  partition  P'  and  then  follow  it  with  a 
coarsening  step  whose  sole  purpose  is  to  give  a  control  on  the  size  of  the  output  partition. 
The  description  of  this  algorithm  will  serve  to  set  the  value  of  the  constants  a,  (3,  5, 7,  A* 
which  have  appeared  earlier  but  were  left  unspecified.  We  call  the  algorithm  for  the  main 
loop  MAIN. 

MAIN  (P,/i,Mr(pj)  — >■  P:  This  algorithm  takes  as  input  a  tolerance  p,  an  admissible 
partition  P ,  and  an  initial  approximation  ur(p)  to  satisfying  (3.6)  and 

i(r(P»  <  p  (7.1) 


and 

P(/,P)<A>2. 


(7.2) 


36 


It  outputs  an  admissible  partition  P  :  =  MAIN  (P,  p,  ur^pfi  which  is  a  refinement  of  P 
and  satisfies 

4(r(P))  <  P/2  (7.3) 

and 

E(f,P)<A*p2/ 2.  (7.4) 

Here  A*  is  the  constant  appearing  in  REDUCE  with  p '  :=  ^fap  and  hence  is  assumed 
to  satisfy 

A’-ami  (?'5) 

We  now  describe  the  steps  in  MAIN. 

STEP1:  Apply  REDUCE  with  inputs  r(P),  p,  and  p'  =  ^Joip  with  a  the  constant  in 
COARSE,  see  (4-42)  and  with  hr(p)  used  as  the  initial  approximation  in  any  application 
of  GAL.  The  output  P'  =  REDUCE  (r(P), //,  y/a//,  ur(p))  is  an  admissible  partition 
which  satisfies  (6.13)  and  in  particular  $(P')  <  ap2. 

STEP2:  Apply  COARSE  with  inputs  P,P',p,  and  up<  :=  ur(p)  (note  that  P'  is  a 
refinement  ofr(P)  and  therefore  ur (p)  is  in  Spi),  and  obtain  the  output 
P  =  COARSE  (P'  ,p,ur(p))  which,  on  account  of  (4.67),  satisfies 

|||«  —  Mr(p)|||2  <  (3p2 .  (7.6) 

STEP  3:  Define  P  to  be  the  common  refinement  of  P  and  P. 

STEP  4:  Apply  APPROX  with  input  P  and  tolerance  {A*  jTfi^p  and  obtain  as  output 
the  partition  P  =  APPROX(P,  [A* /2)1/2 p)  which  satisfies  E(f,P)  <  A*p2/2. 

Let  us  now  check  that  the  conclusions  stated  in  MAIN  are  indeed  valid.  The  condition 
(7.4)  is  an  obvious  consequence  of  STEP  4  in  MAIN.  Furthermore,  (7.5)  enables  the 
application  of  REDUCE  in  STEP1.  So  it  remains  to  verify  (7.3).  The  desired  bound 
on  <f>  will  follow  again  from  Remark  6.1  whose  applicability  will  hinge  on  the  choice 
of  A*  and  of  the  parameter  fj  which  has  yet  to  be  fixed.  By  (7.5)  and  (7.4),  we  have 
(Ai/2A2)^2/2  >  A* p2 / 2  >  E(f,P).  Since  P  is  a  refinement  of  P,  we  have 

Asl'U  —  Mr(p)|2  <  ^4-5  III —  Ur(P)||2  —  ^5/^2,  (7.7) 

where  we  have  used  (7.6)  in  the  last  step.  Thus  whenever  <  1/(2A5),  Remark  6.f 
applies  and  yields  (7.3)  (with  P  used  for  both  P  and  P').  We  impose  this  condition  and 
thereby  specify  the  value  of  (3  as 

P  =  V(2  A5).  (7.8) 

This  in  turn  now  allows  the  specification  of  a  and  7  through  our  earlier  requirements 
(4.66)  and  (4.73). 
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Proposition  7.1  There  is  a  constant  C5  such  that  the  output  P  =  MAIN (P,p,up) 
satisfies 

#(P)  <  C5#(P)  +  C2m(P\P)  (7.9) 

where  m(P\P )  is  the  number  of  markings  used  in  creating  P  from  P.  (Here  C2  is  the 
constant  of  (2.16).)  Also,  the  number  of  flops  JV(MAIN,  P,  fi,  ur(P))  used  in  computing 
P  is  bounded  by  the  right  side  of  (7.9)  times  a  constant  C(p)  that  depends  only  on  the 
maximum  of  the  ratios  p  =  A' / 5  where  A'  measures  the  accuracy  of  the  initial  guess  in 
each  call  of  GAL,  see  (3.6).  In  the  case  that  f  G  AS{H  x)  and  u  G  As  =  AS{H((Q)),  we 
have  the  inequality 

*(P)  <  #{P )  +  C5(s)(||ti||^.V_1/*  +  ll/II^.V17’)  (7.10) 

and 

JV(MAIN,  P,  p,  ur(p))  <  C5(p,s)  {#(P0)  +  (ll/lli7:  +  M\HsS)p-1/s}  (7.11) 

with  Cflp,  s)  a  constant  depending  only  on  s  and  the  maximum  of  the  ratios  A' / 8  corre¬ 
sponding  to  the  accuracies  of  the  initial  guesses  in  the  calls  of  GAL  in  REDUCE  and 
COARSE. 

Proof:  Since  p/ p!  is  fixed,  (6.17)  ensures  that  the  output  P1  of  REDUCE  in  STEP1 
of  MAIN  satisfies 

#(P')  <  C3#(P). 

Of  course,  the  number  of  elements  in  P  is  also  bounded  by  this  number  since  P  C _P'. 
Since  P  is  a  common  refinement  of  P  and  P,  it  also  satisfies  #(P)  <  C#(P).  If  m(P|P)  is 
the  number  of  markings  used  in  the  creation  of  P  from  P  in  the  application  of  APPROX 
in  STEP4,  then  by  (2.16),  we  have 

#(P)  <  #(Po)  +  C2(m(P\P0)  +  m(P\P))  <  C4#(P)  +  C2m(P\P).  (7.12) 

This  proves  (7.9).  Moreover,  by  Remark  3.1  the  number  of  flops  required  by  the  various 
calls  of  GAL  remains  proportional  to  the  cardinalities  of  the  respective  partitions  times 
a  factor  C(p)  that  depends  only  on  the  accuracy  quotients  p  =  A' / 5  relating  the  output 
accuracy  to  that  of  the  initial  guess.  For  each  of  the  algorithms  REDUCE,  COARSE, 
and  APPROX,  we  have  therefore  shown  that  the  number  of  computations  is  bounded 
by  the  maximum  cardinalities  of  the  inputs  and  outputs  of  these  algorithms  times  C(p), 
see  (6.18),  (6.9)  and  (4.68).  Thus  the  right  side  of  (7.9)  multiplied  by  C(p)  also  provides 
an  upper  bound  for  the  number  of  computations  used  in  MAIN. 

Now  suppose  that  /  G  AflH-1)  and  u  G  As.  By  (4.69),  we  have 

#(P)  <  #(Po)  +  C2(s)||u||^V1/s  (7.13) 

and  so 

fl(P)  <  #(P)  +  C2(s)||u||^V1/s.  (7.14) 

When  we  now  apply  STEP4,  we  can  use  (4.34)  to  obtain  the  bound 

m(P\P)  <  CiMII/II^V1''.  (7-15) 
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Hence,  using  (2.16),  we  obtain 


#(4  <  #(P)  +  C2(m(P|P)) 

<  #(P)+Ct(s)(\\u\\1i:ii-1l‘  +  ll/ll  V.v-17')  (7.16) 

where  the  last  inequality  uses  (7.14)  and  (7.15).  This  gives  (7.10).  The  proof  of  (7.11)  is 
similar  but  takes  in  addition  the  cost  (4.68),  (6.5)  and  (6.18)  of  determining  approximate 
Galerkin  solutions  in  REDUCE  and  COARSE.  © 


8  An  adaptive  algorithm  with  coarsening 

In  this  section,  we  shall  formulate  our  adaptive  algorithm  for  solving  (1.1)  and  prove 
the  convergence  properties  of  this  algorithm.  We  shall  call  this  algorithm  ALG.  We 
recall  the  numerical  approximation  <f>(P)  of  <f>(P)  which  is  computed  through  the  aid  of 
the  numerical  approximation  uP  to  Up.  Regarding  the  constraints  on  the  accuracy  of  this 
approximation  reflected  by  the  parameter  5  in  (3.7),  note  that  <)  <  s/a  implies  the  validity 
of  both  conditions  in  (4.66)  and  (4.73)  on  5.  Thus  we  fix 

S2  =  min  {a,  6/(4Bi)}.  (8.1) 

so  that,  in  particular,  B\8 2  <1/4.  If  //  is  the  input  tolerance  in  GAL,  then  from  (6.7), 
we  have 

|$(P)  -  4>(P)|  <  Bx82p2.  (8.2) 


ALG  (e,  P0)  — *  (P,  up):  This  algorithm  takes  as  input  a  desired  tolerance  e  >  0  and  an 
initial  partition  Po  and  outputs  a  partition  P  which  satisfies 

$(P)  <  £2/ 4,  (8.3) 


(and  hence,  by  (5.5),  |«  —  uP\\  <  e/2),  and  it  outputs  a  numerical  approximation  up  to 
up  which  satisfies 

||u  —  up  If  <  e  (8.4) 

by  using  the  following  steps: 

(I):  (Initialization)  For  the  initial  partition  P0,  we  take  Mr(p0)  :=  0  as  the  initial  approx¬ 
imation  of  ur(p0)  and  p  =  1  as  the  tolerance  in  GAL.  We  let  iir(p0)  :=  GAL(r(P0),  1,0) 
be  the  output  of  GAL  for  these  inputs.  We  compute  $(r(Po))  using  iir(p0)  and  we  further 
compute  E(f,  P0) .  We  define 


We  therefore  have 


and  by  (8.2)  and  (8.1) 


£o  :=  max{«S(r(P0))  +  1/4 ,E(f,P0)/A*}. 
E(f,P0)<A*e20 

$(r(P0))  <  <h(r(P0))  +  1/4  <  4 


(8.5) 

(8.6) 

(8.7) 
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If  e o  <  e/2  we  stop  and  output  P  :=  r(Po)  and  up  :=  ur(p0y  If  not,  let  n  be  the  smallest 
integer  such  that  <  e2/4  and  proceed  to  STEP  II. 

(II) :  For  k  =  0, 1, . . . ,  n—  1,  we  apply  MAIN  with  inputs  Pk  and  ek,  satisfying  Q(r(Pk))  < 
el  and  E(f,Pk )  <  A*e2k,  to  obtain  the  partition  Pk+i  =  MAIN(Pfc,  £fc,  iir(pfc)).  In  the 
application  of  MAIN,  we  use  as  an  initial  guess  ur(pk)  :=  iir(p0)  when  k  =  0,  and 
ur(pk)  '■  =  wr(pfe_1)  when  k  >  0,  since  the  latter  function  has  already  been  computed  in 
the  application  of  REDUCE  in  MAIN  at  the  previous  stage.  By  (7.3)  and  (7-4)  we 
know  that  $(r(Pk+1))  <  e|/2,  E(f,Pk+1)  <  A*e\/ 2. 

(III) :  Define  P  =  r(Pn),  and  compute  up  :=  GAL (P,  e,ur(pn_1)) ■ 


The  following  is  the  main  result  of  this  paper. 


Theorem  8.1  For  any  function  f  G  L2(D)  and  any  e  >  0,  ALG  produces  a  partition  P 
for  which 

|ii  —  up\\2  <  $(P)  <  e2/4  (8.8) 

and 

|ii  —  -tip If  <  e  (8.9) 

If  s  >  0  and  u  G  As ,  and  f  G  As ,  then 

*(P)  <  C(s)(#(P„)  +  ll/ll#  +  II «h/.‘)£-1/*  (8.10) 


with  C(s)  >  0  a  constant  depending  only  on  s,  ||n||p'i(Q),  and  the  initial  partition  P0. 
Moreover,  The  number  of  computations  used  in  producing  P  does  not  exceed  C(s)(#(P0)  + 


Proof:  The  conclusion  (8.8)  follows  immediately  from  the  stopping  criterion  and  the 
initialization  (I).  To  show  (8.9),  we  estimate 


II u  —  tip  I  <  ll'U  —  lip  I  +  I'Up  —  tip  I  <  e/2  +  5e  <  e  (8.11) 

because  8  <  1/2  (see  (8.1)). 

We  now  turn  to  bounding  the  number  of  elements  in  P  and  the  number  of  operations 
used  in  the  algorithm  in  terms  of  the  target  accuracy  e.  To  begin  with,  let  us  identify 
next  the  constants  A'  entering  the  estimate  for  the  initial  guesses  in  the  various  calls  of 
GAL.  In  step  (I)  we  have  ur(p0)  =  0  so  that 

iu  -  “r(p0)||2  =  a(u,u)  =  (/, u)  <  ||'ii||p-i(n)||/||n-i(n)  <  Cq1||'h|||/||p--i(o), 
so  that  in  this  case  A1  <  Cq1  ||/||fr-i(n)- 

At  the  kth.  stage  of  step  (II)  GAL  is  invoked  several  times  by  NMMMS  in  REDUCE 
and  by  COARSE.  Each  time  ur(pk)  =  ur(pk)  is  used  as  an  initial  guess.  In  view  of  the 
accuracy  tolerance  of  ur(pk_,)  chosen  in  the  first  call  of  NMMNS  used  in  REDUCE,  we 
have 


til  —  'll. 


W I 


<  I  'll  -  Wr(Pfc_1)  I  +  »r(li  j)  -  Ur(Pk-l)  III 

<  4>(r(Pfc_i))1/2  +  8^l~^-£k-i  <  V2  ^1  +  £k- 
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In  fact,  the  tightest  accuracy  tolerance  used  by  GAL  in  REDUCE,  or  in  COARSE 
is  £k\fa/\/A^.  The  same  argument  applies  for  Step  III.  Thus  in  all  cases  the  ratio  p  = 
A' / 5  is  uniformly  bounded.  By  Proposition  7.1  the  computational  cost  of  GAL  remains 
proportional  to  the  cardinality  of  the  respective  partition. 

To  verify  now  (8.10),  suppose  that  s  >  0  and  u  G  As,  and  /  G  As.  Let  n  be  the  final 
value  of  k  when  the  algorithm  stops,  i.e.,  P  =  r(Pn).  To  start  out  with,  we  note  first  that 
e0  <  C  max{l,  ||m||#i(q),  ||/||l2(o)}.  In  fact,  if  £0  is  given  by  the  second  term  on  the  right 
hand  side  of  (8.5)  we  infer  from  (4.15)  that  £0  <  C1I/||l2(q)-  When  the  first  term  on  the 
right  hand  side  of  (8.5)  dominates  we  distinguish  two  cases.  If  E(f,  P0) / A*  <  $(r(P0))  we 
conclude  from  (6.1),  (6.2)  and  (8.2)  that  £q  —  1/2  =  <f>(r(Po))  — 1/4  <  $(r(Po))  <  A5|||«|||2. 
The  other  case  E(f,  Po) / A *  >  $(r(Po))  >  —  1/2  also  leads  to  the  bound  claimed  above. 

Subsequent  ek  will  satisfy  this  same  bound.  From  (7.16)  we  know  that  for  k  =  1,2, ...  ,n, 
we  have 

*(Pm)  -  #(Pk)  <  CdsMuW^el1'-  +  ll/II^X17*).  (8.12) 

Since  by  definition  £&  >  e2^n~l~kA2  we  have 


#(Pn) 


n— 1 


#(P0)  +  ^  + 

k= 0 


^5(^)£q  1/S 

l  _  2-V2s 


and  since  P  =  r(Pn),  we  have  #(P)  <  C#(Pn)  which  proves  (8.10). 

To  bound  the  number  of  computations  used  in  ALG,  we  know  from  (7.11)  that  the 
number  of  computations  N/.  used  in  computing  P*,,  k  =  1, . . . ,  n,  is  bounded  by 

Nk  <  C5(s)  (#(P0)  +  Hull#  +  \\f\\%s)  (£fc)“1/s  (8.13) 

since,  according  to  our  initial  comments,  the  quotient  p  is  uniformly  bounded  and  therefore 
the  computational  cost  of  each  call  of  GAL  remains  proportional  to  the  corresponding 
partition  size.  Similarly,  the  number  of  computations  used  in  computing  P0  is  bounded 
by  a  constant  multiple  of  #(Po).  Thus  the  total  number  of  computations  does  not  exceed 


CM(#(Po)UI/lb'UMfi.',))5>*r1/‘  <  C(S)(#(Po)+ll/llp+ll«lli/*)E-1/‘.  (8.14) 


-1/s 


k= 0 


© 


9  Appendix 

Recall  from  (4.7)  and  (4.17)  that  the  classes  As(P1(fl))  and  As(P_1(fl))  are  determined 
through  the  nonlinear  approximation  properties  of  their  elements.  For  instance,  the  ele¬ 
ments  in  As(P1(fl))  can  be  approximated  in  IP  (Q)  to  accuracy  £  on  admissible  partitions 
with  the  order  of  e~lP  cells.  Obviously  not  all  of  these  elements  can  be  approximated 
on  uniform  partitions  with  the  same  accuracy.  In  this  section  we  wish  to  explain  (with¬ 
out  proofs)  which  properties  make  a  function  belong  to  As(P1(0)),  say.  This  amounts 
to  relating  the  above  approximation  classes  to  regularity.  The  following  results  about 
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the  behavior  of  <rn(u),  defined  by  (4.3),  are  in  principle  known.  They  invoke  the  Besov 
smoothness  of  functions.  We  refer  the  reader  to  any  of  the  standard  treatments  of  Besov 
spaces  (e.g.[21,  1,  8])  for  the  definition  of  the  Besov  spaces  B*(LT(Q))  and  only  remark 
here  that  such  a  space  is  a  smoothness  space  consisting  of  functions  with  smoothness  order 
s  (number  of  derivatives)  measured  in  Lr.  For  example  RJ(L2)  is  identical  with  Hs  and 
B^Lr)  is  a  Lipschitz  space  in  Lr  whenever  s  is  not  an  integer.  The  role  of  q  is  secondary 
and  only  serves  to  give  a  fine  grading  of  the  spaces  important  in  many  applications  such 
as  embedding  theorems. 

The  results  about  an,  defined  by  (4.3),  can  be  formulated  as  follows. 

Theorem  9.1  If  u  G  Bf+1(LT(fl))  with  0  <  a  <  1  and  1/r  <  (a  +  l)/2,  then 
an{u)  =  inf  inf  \\u  —  S'!  <  Con~a^2\\u\\Ba+i,L 

PeVnSeSp  T  y  T>> 

where  the  constant  Co  depends  on  the  discrepancy  5  :=  _  I  when  5  tends  to  zero. 

A  few  brief  comments  on  the  range  of  the  involved  parameters  are  in  order. 

Remark  9.2  The  restriction  on  a  arises  because  we  are  approximating  in  H 1  using  piece- 
wise  linears  and  so  a  +  1  <  2.  Thus  in  two  spatial  dimensions  IV-1/2  is  the  highest  at¬ 
tainable  order  in  the  class  B((LT ))  with  r_1  <  1.  The  restriction  on  r  arises  from  the 
Sobolev  embedding  theorem.  It  guarantees  that  the  Besov  space  is  embedded  compactly  in 
H1 .  When  r_1  >  (a  +  l)/2,  this  Besov  space  is  no  longer  embedded  in  H1 . 

The  above  regularity  assumptions  are  only  sufficient  for  u  to  belong  to  A0:,/2 ,  say.  The 
following  inverse  theorem  shows  that,  although  this  is  not  a  complete  characterization,  it 
is  sharp  in  the  following  sense. 

Theorem  9.3  If  u  G  iJ1(fl)  satisfies  crn(u )  <  Cn~a /2  then  u  G  Bf+1(LT)  for  all  r 
satisfying  r_1  =  (a  +  l)/2. 

The  proofs  of  Theorems  9.1,  9.3  can  be  found  in  [3].  For  related  results  see  [14], 
Finally,  as  a  consequence  of  known  results  on  the  metric  entropy  of  unit  balls  of  Besov 
classes,  the  order  of  Besov  smoothness  limits  the  approximation  order  in  the  following 
sense. 

Theorem  9.4  For  each  0  <  a  <  1  and  r_1  <  (a  +  l)/2  we  have 

sup  (in{u)  >  CN~a/ 2  (9.1) 

/e(7(B?+1(LT(Q))) 

with  C  >  0  an  absolute  constant.  (Here  U(X )  denotes  the  unit  ball  of  a  normed  space 

X). 

The  above  results  should  be  read  as  follows.  First  note  that  for  fixed  [3  the  smoothness 
measure  given  by  the  space  B^(LT))  becomes  weaker  when  r  decreases.  Theorem  9.1  says 
that  for  fixed  a  the  loss  of  regularity  incurred  by  decreasing  r  towards  the  critical  value 
2/(«  +  1)  can  be  compensated  by  nonlinear  approximation  so  as  to  retain  the  order  N~a /2 
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of  approximation  in  H1.  Theorem  9.4  says  that  this  order  is  best  possible  with  respect 
to  the  full  unit  ball  in  the  respective  Besov  class.  To  obtain  the  same  approximation 
order  through  spaces  on  (quasi-)  uniform  meshes  is  equivalent  to  u  belonging  in  the  much 
smaller  space  B^~1(L2)  (which  is  close  to  Ha+1). 

Thus  whenever  the  solution  u  has  sufficient  regularity  measured  in  L2  the  best  pos¬ 
sible  balance  of  accuracy  versus  the  number  of  degrees  of  freedom  can  be  obtained  at 
least  asymptotically  by  using  quasi-uniform  meshes.  Nonlinear  approximation  provides 
asymptotically  better  rates  whenever  u  has  a  higher  regularity  in  Lr  for  r  <  2.  Theorem 
8.1  combined  with  Theorem  9.1  then  say  that  this  better  rate  is  actually  recovered  by 
the  above  adaptive  algorithm  which  is  a  special  instance  of  a  nonlinear  process.  It  has 
recently  been  shown  in  [10,  9]  that,  depending  on  the  smoothness  of  the  domain  Q,  the 
solution  to  Poisson’s  equation  indeed  has  typically  higher  Besov  than  Sobolev  regularity 
in  the  sense  that 

a*  :  =  sup  {a  :  u  G  B"+1(Lr),  r_1  =  (ct  +  l)/2}  >  (3*  :=  sup  {/?  :  u  <E  B^1(L2)}. 

Hence  in  those  cases  the  use  of  the  adaptive  scheme  gives  a  better  asymptotic  work/accuracy 
rate. 

Acknowledgement:  We  are  very  indebted  to  P.  Morin  and  R.  Nochetto  for  valuable 
comments  and  for  pointing  out  to  us  an  erroneous  argument  in  an  earlier  version  of  this 
paper. 
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